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THEME 


Barriers  to  information  transfer  are  a  significant  factor  in  the  aerospace  and  defence  R  &  D  community.  This 
community  most  often  uses  the  scientific  and  technical  information  of  its  own  nation,  although  the  literature  originates 
worldwide.  Understanding  these  barriers  will  help  identify  the  best  applications  of  information  technology  for  barrier 
reduction. 

The  programme  sought  first  to  define  the  magnitude  of  the  problem  in  terms  of  aerospace  and  defence  information¬ 
seeking  behaviour,  and  to  identify  the  language,  culture  and  economic  barriers  to  information  transfer. 

The  next  sessions  explored  the  man-machine  issues,  such  as  the  viability  of  computer-assisted  translation,  and  machine 
and  data  issues,  such  as  the  availability  and  need  for  standards.  The  status  and  potential  application  of  information 
technology  in  these  areas  was  also  discussed. 


*  *  * 


Les  barrieres  au  transfert  des  informations  dans  la  Communaute  Aerospatiale  de  Recherche  et  de  Developpement 
posent  un  reel  probleme  car  cette  communaute  utilise  le  plus  souvent  les  informations  scientifiqucs  et  techniques  de  son 
pays;  bien  que  la  litterature  technique  soit  universelle. 

La  comprehension  de  ces  barrieres  permeitia  de  determiner  les  meilleures  applications  aux  technologies  d'information 
pour  les  reduire. 

Le  programme  a  cherche,  en  premier,  a  definir  l'amplitude  du  probleme  parmi  les  choi-cheurs  d'information  dans  les 
domaines  aerospatiales  et  de  defense  et  a  identifier  les  barrieres  economiques,  de  langage  et  culturelles  au  transfert  des 
informations. 

Les  sessions  suivantes  ont  explore  les  problemes  homme-machine  telle  que  la  viabilite  de  I'assistance  des  computeurs 
pour  traductions',  et  les  problemes  de  machines  et  de  donnees  telle  que  la  disponibilite  et  la  necessitc  de  standards.  L'etat 
present  et  le  potentiel  de  l  application  des  technologies  pour  1'information  dans  ces  domaines  ont  ete  aussi  discutes. 
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BARRIERS  TO  THE  INTERNATIONAL  TRANSFER  OF  INFORMATION  IN 
AEROSPACE  AND  DEFENSE 


by 

James  J. Harford 
Barbara  Lawrence 

American  Institute  of  Aeronautics  and  Astronautics 
.'70  L'Enfant  Promenade.  SW 
Washington,  D.C.  20024 


INTRODUCTION 

Let's  take  our  keynote  for  this  meeting  from  a  recent  study  of  the  US  National  Academy  of  Engineering  (N  AEj.  entitled 
"Strengthening  US  Engineering  Through  International  Cooperation",  ( 1 ): 

"The  world  in  which  US  engineers  and  technologists  learn  and  practice  their  profession  is  changing  more  rapidly  than  the 
institutions,  policies,  and  programs  intended  to  ensure  our  future  economic  growth,  security,  and  welfare.  The  United 
States  and  especially  its  engineering  and  science  community  needs  to  adopt  new  attitudes  and  strategies  if  we  are  to 
maintain  or  enhance  our  industrial  health  and  standard  of  living  in  the  face  of  the  reality  of  intense  international  economic 
competition.  Responses  must  be  political,  economic,  and  cultural,  as  well  as  technological.  Although  improvements  in  the 
competitive  status  of  the  United  States  will  not  come  about  solely  as  a  result  of  our  being  more  aware  of  technological 
progress  made  outside  the  United  States,  technological  isolation  will  surely  undermine  the  future  of  our  industrial  and 
educational  institutions." 

The  United  States  may  not  be  the  only  NATO  nation  with  isolationist  technological  tendencies,  but  it  probably  has  those 
tendencies  to  a  greater  degree  than  most  others,  because  for  so  long  v.  e  have  been  the  undisputed  technological  leader,  while  we 
have  been  the  most  backward  linguistically. 

Professional  societies  have  a  special  role  in  correcting  this  most  unsatisfactory  situation.  Indeed,  as  organizations 
chartered  to  promote  information  exchange,  we  have  a  deep  obligation  to  help  import  information,  and  encourage 
international  interactions  and  awareness,  to  a  much  greater  degree  than  in  the  past.  To  do  this  we  should  first  understand  the 
barriers  to  the  global  flow  of  information. 

This  morning  I  II  be  talking  about  how  international  information  transfer  occurs  in  aerospace  and  defense  and  I  will  try  to 
characterize  the  barriers  specific  to  aerospace  and  defence.  T  hen  I  will  challenge  you  to  participate  in  the  main  mission  of  this 
meeting  —  the  development  of  an  action  program  for  both  AGARD  and  your  own  organizations  that  will  reduce  the  barriers. 

ROLE  OF  A  PROFESSIONAL  SOCIETY 

AIAA  has  taken  to  heart  the  recommendation  of  the  above-mentioned  NAE  study  that  "US-based  engineering  and 
professional  societies  should  continue  to  develop  their  international  activities  for  the  benefit  of  both  the  US  and  international 
engineering  and  technology  enterprise".  (2) 

We  are  the  oldest  and  largest  professional  society  of  aerospace  scientists  and  engineers  in  the  world.  Currenllv  we  have 
40.000  members.  Although  only  5%  are  foreign,  the  percentage  is  growing  strongly.  Our  information  transfer  activities  fall  into 
three  areas  —  meetings,  publications,  and  education. 

AIAA  holds  more  than  25  technical  meetings  each  year  and  assists  in  organizing  several  international  conferences  |e.g. 
the  annual  congresses  of  the  International  Astronautical  Federation  (1AF).  and  the  biennial  International  Congress  of  the 
Aeronautical  Sciences  (ICAS)|. 

Through  the  good  work  of  members  of  our  Technical  Committees  efforts  to  increase  international  participation  in  our 
domestic  meetings  have  been  quite  successful.  Last  year's  Aerospace  Sciences  meeting,  for  example,  had  1 1%  non-US  papers, 
the  Communications  Satellite  Systems  Conference  had  42%.  and  the  Structures.  Structural  Dynamics  and  Materials  meeting 
had  10%. 

Even  the  balance  of  international  participation  in  AIAA  meetings  has  changed.  Earlier.  English-speaking  Western 
European  nations  dominated.  Now  our  meetings  regularly  include  papers  from  the  USSR.China.  Latin  America,  and  above  all 
Japan.  The  increase  in  Japanese  participation  is  dramatic.  In  the  1 980  Aerospace  Sciences  meeting  there  were  no  Japanese 
papers.  In  1987, 20% of  the  foreign  papers  were  from  Japan. 

AIAA's  Publications  Committee  has  also  taken  up  the  call  to  do  its  part  to  improve  the  diffusion  of  international 
information.  Aerospace  America.  AIAA's  membership  magazine,  helps  readers  maintain  currency  through  monthly  reporting 
on  aerospace  trends  in  Europe,  Japan  and  the  Soviet  Union. 

One  journal,  the  Journal  of  Aircraft,  has  an  international  board  of  editors  whose  primary  role  is  to  solicit  non-US 
manuscripts.  The  Publications  Committee  recently  voted  to  encourage  the  editors-in-chief  of  the  other  AIAA  journals  to  do 
likewise. 
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Soviet  Abstracts  are  a  new  feature  in  the  basic  research  journal,  the  AIAA  Journal.  Mach  month  the  Journal  publishes  a 
minibibliography  of  abstracts  of  recent  Soviet  work  on  some  timely  topic. 

AlAA's  NASA-supported  secondary  information  service  publishes  the  bi-weekly  International  Aerospace  Abstracts 
(IAA)  and  markets  the  Aerospace  Database,  a  major  information  resource  for  international  literature  in  aerospace  and  related 
technologies.  The  sources  analysed  for  these  services  come  from  over  1 00  countries  and  are  in  30  languages.  Since  one  half  of 
the  aerospace  literature  information  originates  outside  the  US,  it  is  clear  that  the  need  to  he  aware  of  world*  ide  literature  is  not 
an  idle  notion. 

AIAA  is  experimenting  with  computer-based  continuing  education,  coordinating  homework  on  PCs  with  electronic  mail 
and  teleeducation  eliminates  the  need  for  travel,  and  opens  the  door  to  international  participation  —  both  for  students  and 
teachers. 

MOTIVES  FOR  INTERNATIONAL  COVERAGE 

Knowing  the  motives  for  such  efforts  at  international  coverage  may  help  us  later  w  hen  we  address  w  ays  to  break  down  the 
barriers.  Wallerstein.  in  a  National  Research  Council  study  entitled  "Scientific  and  Technical  Cooperation  Among 
Industrialized  Nations"  (3),  identified  several  motives  for  international  cooperation: 

Cost  Sharing  —  In  the  current  economy,  this  is  perhaps  the  most  pressing  argument  for  international  cooperation.  Dan 
Greenberg,  writing  about  funding  for  the  superconducting  supercollider  says,  “since  high  energy  physics  is  an  open. 

fundamental  field  of  science . the  supercollider  should  be  internationalized  and  its  costs  spread  among  a  consortium  of 

scientifically  advanced  nations."  (4)  Don  Fink,  editor  of  Aviation  Week,  comes  to  the  same  conclusion  writing  about  the 
recent  cancellation  of  the  Israeli  fighter  plane.  "The  l.avi  cancellation  cannot  be  considered  a  strictly  Israeli  problem.  It 
also  must  be  looked  upon  as  the  wave  of  the  future  for  all  the  Free  World  nations.  The  financial  realities  of  the  modern 
world  could  well  mean  that  international  cooperative  efforts  are  the  only  practical  wav  their  military  requirements  can  be 
met."  (3) 

(  oncept  Development  —  formal  exploitation  of  the  in\  isible  college 
Enhancement  of  Scientific  amt  Engineering  (  ompetencc 

Political  (  onsideraiions  —  enhance  alliances,  assist  developing  nations,  gain  prestige 
The  other  objectives  seem  to  have  relevance  now : 

Defeating  the  "Sot-lnventeil-l  lere"  t  Sill)  <i»  %vn  or  the  mrrojam  know  it  ?"  "iew 

Supporting  Development  of  Sew  Technological  Industries  —  as  the  MIX'  has  done,  when  it  is  clear  that,  particularly  for 
information  technology  based  products,  sticking  to  national  boundaries  means  markets  that  are  too  small. 

Let  us  remember  these  motives,  and  consider  them  when  we  seek  solutions  to  reducing  the  international  information 
transfer  bnrneis. 

HOW  INTERNATIONAL  INFORMATION  TRANSFER  OCCURS 

Some  avenues  of  information  transfer  are  specific  to  the  concept  international:  others  are  generic. 

The  aerospace  and  defense  world  is  primarily  a  community  of  engineers.  And  engineers  are  notorious  for  preferring  the 
route  of  informal  communication  above  all.  To  an  engineer  the  maxim  that  “the  accessibility  of  information  is  more  important 
than  the  perceived  value"  (7)  is  in  truth  a  dictum.  Invisible  colleges  are  real  institutions,  and  the  telephone  stretches  their 
boundaries.  Indeed,  electronic  mail  networks,  such  as  ARPANET,  function  as  the  infrastructure  of  the  invisible  college  in  its 
open  version. 

One  way  to  join  the  informal  network,  the  invisible  college,  is  by  attending  meetings  and  sy  mposia.  Various  international 
conferences  provide  a  significant  channel  of  international  information  exchange,  for  example  the  previously  mentioned 
International  Astronautical  Federation  (IAF)  and  International  Congress  of  the  Aeronautical  Sciences  (ICAS).  as  well  as  the 
Committee  on  Data  for  Science  and  Technology  (CODATA). 

The  final  generic  information  transfer  avenue  is,  of  course,  the  literature.  In  aerospace  and  defense  the  important  primary 
literature  includes  both  published  literature  and  technical  report  literature.  The  NASA  and  AIAA  programs  emphasize 
international  exchange  agreements  to  ensure  building  the  most  comprehensive  aerospace  information  resources.  Thh 
audience  needs  no  further  details,  but  I  will  remind  you  that  the  very  existence  of  the  NATO  AGARD  Technical  Information 
Panel  (TIP)  attests  to  the  multinational  original  of  the  literature. 

Academic  exchanges  can  be  used  to  foster  international  information  flow.  Other  nations  use  this  route.  In  the  US.  in  fact, 
well  over  50%  of  engineering  graduate  students  are  foreign.  (8) 

Sabbaticals  and  post-doctoral  fellowships  are  the  traditional  means  of  academic  exchange.  However,  in  the  US  the 


opportunities  have  decreased  significantly.  In  the  early  1960s  more  than  50%  of  National  Science  Foundation  (NSF) 
postdoctoral  fellows  went  to  foreign  institutions.  By  the  early  1980s  the  absolute  number  had  dropped  to  one  fifth,  and  only 
10%  of  the  fellows  went  to  foreign  institutions. 

Academic  exchanges  rate  high  on  the  list  of  methods  of  international  cooperation  among  NSF  engineering  program 
managers.  They  also  rate  high  within  NATO.  Since  the  beginning  of  the  1960s  the  NATO  Science  Fellowship  program  has 
supported  1 50,000  scientists  and  engineers  in  academic  exchanges  throughout  the  Alliance.  (9) 

Thus,  international  organizations  play  an  important  role  in  facilitating  international  information  transfer.  The  NATO 
science  program  is  certainly  a  leader.  NATO  has  not  only  the  Fellowship  program,  but  also  the  Research  Collaborative  Grants 
program,  and  the  very  special  Advanced  Study  Institutes. 

AGARD,  with  its  subject  expert  panel  structure  provides  extended  opportunities  for  formal  and  informal  exchange. 

Bilateral  programs  are  still  another  cross  national  channel  for  information  transfer.  For  example,  the  US  has  used  bilateral 
arrangements  between  NASA  and  ESA,  which  includes  the  technical  information  exchange  arrangements  that  provide  reports 
input  to  the  abstract  publication.  STAR. 

Lastly,  industry  should  not  be  forgotten  in  a  reveiw  of  international  exchange.  Some  companies  themselves  are 
multinational,  with  research  and  development  centers  around  the  world;  IBM  being  perhaps  the  premier  example.  Companies 
may  license  technology  from  elsewhere.  US  companies  have  licensed  Soviet  welding  technology.  Also,  industry  may  engage  in 
joint  ventures.  For  instance.  Boeing  has  joined  with  both  Japanese  and  Italian  companies  to  manufacture  the  767  transport 
aircraft. 

TYPES  OF  BARRIERS 

With  that  as  a  background,  let  us  review  the  inhibitors  to  information  exchange  —  the  barriers  —  in  preparation  for  their 
detailed  discussion  during  the  next  two  days.  The  first  are  cultural  barriers,  including  the  aforementioned  Not-Invented-Here 
Syndrome,  or  technological  arrogance.  These  seem  to  stem  from  the  societal  or  organizational  mores  of  some  scientists  and 
engineers  who  simply  don't  take  into  account  the  foreign  literature,  perhaps  because  of  a  lack  of  education  in  the  use  and  value 
of  information. 

A  second  major  barrier  is  economic  —  an  alleged  defense  against  competitors.  This  can.  and  often  does,  backfire,  as  it  has 
for  the  US  semiconductor  industry. 

A  third  barrier  is  language  —  already  referred  to  and  certainly  the  most  obvious  challenge  to  a  global  community.  J. Michel 
(10)  has  said  that  in  science  and  technology  there  are  only  five  languages  of  significance  —  English.  Russian.  German.  French, 
and  Japanese.  In  aerospace  we  should  now  add  a  sixth  language,  Chinese.  Van  Bergeijk  ( 1 1 )  of  the  International  Translation 
Center  said,  in  I  "80,  that  onlv  25— 30%  of  important  literature  is  translated.  We  will  hear  more  about  the  problems  and  the 
proposed  solutions  regarding  language  barriers  in  the  papers  that  follow. 

Political  barriers  often  result  from  national  pride  —  we’ll  do  it  ourselves  —  or  as  in  the  case  of  the  Israeli  Lavi  fighter,  we 
can  do  anything  —  or  from  the  desire  to  protect  national  security.  I  won’t  belabor  these  isuses.  but  wish  to  quote  from  the 
testimony  of  Lew  Allen  in  presenting  the  study  "Balancing  the  National  Interest:  US  National  Security  Export  G'r”r~l> 
Global  Economic  Competition".  ( 1 2) 

"...  the  US  must  continue  to  encourage  vigorous  technology  advance  in  this  country,  and  indeed  among  our  Western  allies 

as  well.  Economic  vitality  generally,  and  economic  strength  in  the  high-technology  sector  particularly,  is  important  for 

national  security." 

It  sometimes  seems  that  the  more  we  develop  and  depend  on  the  use  of  technology  to  support  information  transfer,  the 
more  barriers  we  create.  Questions  of  access,  training,  cost,  and  standards  will  also  be  discussed  in  depth  later  in  this 
symposium. 

AEROSPACE  AND  DEFENSE  NATURE  OF  INFORMATION  BEHAVIOR 

What  makes  us  think  that  the  aerospace  community  needs  to  make  a  special  efiort  at  reducing  the  barriers  to  international 
information  transfer?  There  is  some  evidence  that  the  nature  of  the  large  and  complex  and  often  sensitive  projects  in  this 
industry  contribute  to  the  narrowing  of  visible  horizons,  and  the  loss  of  perceived  need  to  look  beyond  the  current  program  and 
colleagues  at  hand. 

Building  a  missile,  an  airplane,  or  a  space  station  requires  several  thousand  engineers,  all  working  for  several  years  against 
predefined  goals  and  specifications.  There  is  a  can't-fail  attitude  that  may  limit  risk  taking.  In  this  environment  it  is  not 
surprising  that  many  of  our  finest  engineers  remain  bound  by  the  vision  of  the  current  project  and  are  not  concerned  about 
absorbing  new  information. 

How  large  is  the  problem?  Let  me  start  with  an  anecdote.  A  few  years  ago  the  AIAA  Board  of  Directors  felt  that  the  need 
for  the  US  aerospace  community  to  be  more  conscious  of  foreign  literature  had  reached  an  urgent  level.  Therefore,  we  began  to 
publish  a  low-priced  topical  abstract  bulletin  of  Soviet  and  Japanese  literature.  Because  sales  were  slow,  and  a  majority  of 
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orders  were  not  from  the  aeri.-’-ace  industry,  we  cancelled  the  program.  We  then  conducted  a  telephone  survey  of  AIAA 
members  to  udnerstand  why  interest  was  so  low.  The  response  came  back.  "Sure,  following  Soviet  and  Japanese  science  and 
technology  is  an  important  issue,  but  it  is  not  my  job”.  If  a  senior  materials  engineer  does  not  think  it  is  his  or  her  job  to  follow  the 
foreign  sci/tech  literature,  we  certainly  do  have  a  barrier.  ( 1 3) 

Some  comments  on  the  international  nature  of  science  and  technology  today,  based  on  data  from  the  latest  National 
Science  r  >undation  Science  Indicators  ( 1 4)  and  data  we  have  gathered  at  AIAA.  (15) 

—  R&  D  is  increasing  in  all  major  countries.  Although  this  is  true  in  absolute  dollars  for  all  countries,  the  rate  of  increase  is 
larger  outside  the  US.  In  fact  the  number  of  US  scientists  and  engineers,  altheiugh  recovered  after  a  drop,  is  about  the  same 
as  a  decade  ago. 

—  US  dominates  in  the  number  of  science  degrees.  USSR  and  Japan  dominate  in  the  number  of  engineering  degrees. 

—  US  lead  tn  publishing  and  patents  has  diminished.  In  earth  and  space  science  the  percent  of  US  technical  articles 
dropped  from  47%  to  42%  between  1 973  and  1 982.  In  engineering  and  technology  the  drop  was  from  42%  to  38%. 

Particular  to  aerospace: 

—  Published  literature  included  in  IAA  is  50%  non-US.  The  non-US  percent  is  similar  in  the  Chemical  Abstracts  and 
Inspec  databases. 

—  More  US  aircraft  patents  are  granted  to  foreign  inventors  than  to  US  inventors. 

We  concur  with  the  authors  of  the  NSF  study  that  a  "nation's  ability  to  use  the  results  of  scientific  and  technological 
activity  conditions  its  ability  to  succeed  in  international  political  and  economic  competition".  (16)  If  the  doing  of  science  is 
international,  how  well  do  we  do  in  using  the  results?  Not  as  well  as  we  might. 

True,  the  percent  of  references  in  US  articles  to  articles  from  other  countries  is  slowly  increasing.  In  1982  it  was  46%  in  all 
S&T  fields.  It  was  39%  in  earth  and  space  sciences,  and  44%  in  engineering  and  technology.  However  when  we  review  a  few 
key  aerospace  journals,  we  find  that  the  citation  rate  for  foreign  literature  is  considerably  less  than  warranted  based  on  the 
origin  of  aerospace  literature. 

In  the  AIAA  Journal  references  to  foreign  literature  have  remained  only  between  10—15%  for  25  years.  A  spot  check  of 
the  situation  for  two  European  aerospace  journals.  La  Recherche  Aerospatiale ,  and  Zeitschrift  fur  Flugwissenschaften  und 
Weltraumforschung  (ZFW).  shows  a  40—50%  non-European  citation  rate.  Yet  remember  that  50%  of  the  aerospace  literature 
is  non-US.  NATO  nations  are  not  making  full  use  of  this  wealth. 

TABLE  I 

Aerospace  Literature  Distribution  vs  Utilization 


Literature  Distribution  (%) 


US 

Europe 

USSR 

Other 

50 

24 

12 

14 

Literature  Use 

—  %  Citations 

US 

Europe 

USSR 

Other 

AIAA  J 

85 

1 1 

I 

3 

ZFW 

41 

52 

1 

6 

Aerospatiale 

41 

44 

1 

14 

The  percent  of  articles  in  major  national  journals  by  foreign  authors  is  another  measure  of  how  one  nation  pays  attention 
to  the  work  of  other  countries.  In  US  S&T  journals  38%  of  the  1982  articles  were  by  foreign  authors.  In  earth  and  space 
science,  25%  of  articles  were  by  foreign  authors,  and  in  engineering  and  technology,  32%. 

The  percent  of  articles  coauthored  by  scientists  and  engineers  from  organizations  in  different  nations  is  a  last  measure  of 
international  awareness.  The  NSF  data  measure  this  as  a  percent  of  all  institutionally  coauthored  articles.  In  1 982  this  was  1 7% 
in  all  fields,  31%  in  earth  and  space  sciences,  and  20%  in  science  and  technology.  Looking  closer  to  home,  we  identified  the 
percent  of  all  articles  that  were  coauthored  across  international  organizational  boundaries.  2%  of  AIAA  Journal  articles  were 
internationally  coauthored;  4%  of  ZFW  articles,  and  6%  of  La  Recherche  Aerospatiale  articles. 

These  data  demonstrate  that  aerospace  scientists  and  engineers  do  less  well  than  other  S&  E  in  using  the  results  of 
international  research.  We  do  have  a  noticable  problem  with  barriers  to  international  information  transfer. 

CONCLUSION 

The  barriers  to  international  information  transfer  must  come  down.  We  have  a  big  challenge  in  this  conference  to 
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determine  how  to  do  that.  Some  solutions  are  technological,  such  as  electronic  mail,  machine-aided  translation,  standard, 
satellite  transmitted  education  or  conferences.  Other  solutions  are  nontechnological,  including  education  in  languages,  in  the 
value  of  information,  and  in  the  management  and  use  of  information  resources.  At  AIAA  we  have  a  vision  of  the  universally 
connected  scientist/engineer  who  sits  at  a  workstation  which  is  connected  to  a  gateway.  The  gateway  supports  the  tools  and 
resources  to  make  those  technological  barriers  fall.  We  believe  that  in  the  process  of  overcoming  the  technological  barriers,  the 
personal  barriers  will  fall.  I  challenge  you  to  get  on  with  the  conference,  and  on  with  the  solutions.  The  seriousness  of  the 
situation  tells  us  that  there  is  no  time  to  waste. 
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RESUME 

Un  premier  chapitre,  dbcrivant  1 'importance  relative  des  langues  dans  le  monde  en  1987,  permet 
de  se  rendre  compte  que  les  habitants  du  globe  fonctionnent  avec  une  dizaine  de  langues,  qui  couvrent 
chacune  une  region  du  globe.  L'analyse  de  la  dbmographie  et  des  variations  des  bquilibres 
socio-bconomiques  permet  de  dbgager  la  tendance. 

L'usage  des  langues  est  btudib,  dans  le  deuxibme  chapitre,  pour  le  domaine  de  la  science  et  de 
la  technique,  en  fonction  du  type  de  la  communication  (bcrite  ou  orale)  et  du  contenu  (des  mathbmatiques 
aux  technologies)  et  les  tendances  sont  dbgagbes. 

Un  troisibme  chapitre  est  consacrb  aux  tentatives  qui  ont  btb  faites  pour  rbduire  la  barribre 
linguistique  -  comme  le  lancement  d'une  langue  vbhiculaire  unique  (esperanto,  anglais)  ou  le  dbveloppement 
de  l'btude  des  langues  btrangbres.  Les  problbmes  de  traduction,  de  linguistique  et  d ' ordinateurs  ne  sont 
pas  traitbs. 

L'analyse  dbtaillbe  de  la  barribre  culturelle,  du  quatribme  chapitre,  est  effectube  par  plusieurs 
approches  (sociologique,  psychosociologique,  psychanalytique  et  psycholinguistique)  afin  de  dbgager  les 
vbritables  enjeux  individuels  et  collectifs. 

Enfin,  un  dernier  chapitre  permet  de  dbgager  les  faits  porteurs  d'avenir,  comme  le  fonctionnement 
de  la  Commission  des  Communautbs  Europbennes  avec  ses  neuf  langues  officielles,  et  d'avancer  quelques 
pronostics  sur  1'btat  linguistique  du  monde  au  dbbut  du  21bme  sibcle. 


SUMMARY 

The  first  chapter,  which  describes  the  relative  importance  of  languages  in  the  world  in  1987; 
shows  that  the  worlds  population  effectively  makes  use  of  about  ten  languages,  each  of  which  covers  a 
different  area  of  the  globe.  The  trends  are  revealed  by  an  analysis  of  population  density  and  socio-economic 
balances. 

The  use  of  .anguages  in  the  field  of  Science  and  Technology  is  examined  in  the  second  chapter, 
with  respect  to  the  type  of  communication,  either  written  or  spoken,  and  the  content,  from  mathematics 
to  technology,  and  the  trends  are  shown. 

The  third  chapter  is  devoted  to  the  attempts  which  have  been  made  to  overcome  the  linguistic 
barrier,  such  as  the  introduction  of  a  single  common  language  (Esperanto  or  English)  or  the  development 
of  the  study  of  foreign  languages.  The  problems  of  translation,  linguistics  and  the  computer,  are  not 
dealt  with. 

The  detailed  analysis  of  the  cultural  barrier  contained  in  the  fourth  chapter  is  achieved  by 
a  combination  of  several  different  approaches  (sociological,  psychosociological ,  psychoanalytic  and 
psycholinguistic)  and  reveals  just  what  is  at  stake,  both  Individually  and  collectively. 

The  final  chapter  concentrates  on  future  trends,  such  as  the  working  of  the  E.E.C.  with  its  nine 
official  languages,  and  makes  several  predictions  as  to  the  linguistic  state  of  the  world  at  the  start 
of  the  twenty  first  century. 

★ 

★  * 


S0HUA7RE 


C  0  H  T  E  H  T  S 


1  -  L' importance  relative  des  langues  dans  le  monde  en  1987. 

1.1  -  La  situation  actuelle. 

1.2  -  La  tendance. 

2  -  L'usage  des  langues  en  1987  dans  le  domaine  des  sciences  et  techniques. 

2.1  -  La  ndcessite  de  la  communication  scientifique  internationale. 

2.2  -  Le  type  de  la  communication  :  orale,  ecrite. 

2.3  -  Le  type  de  contenu. 

2.4  -  La  tendance. 

3  -  Les  tentatives  de  resolution  de  la  barribre  linguistique. 

3.1  -  L'apprentissage  des  langues  etrangbres. 

3.2  -  Line  langue  internationale  unique. 

3.3  -  Une  langue  internationale  unique  pour  la  communication  scientifique  et  technique. 

4  -  La  barribre  culturelle. 

4.1  -  Les  diffbrentes  approches  : 

.  sociologique, 

.  psychosociologique, 

.  psychanalytique, 

.  psycholinguistique. 

4.2  -  Les  enjeux. 

5  -  Les  faits  porteurs  d'avenir  sur  l'etat  linguistique  du  monde  au  debut  du  21bme  sibcle. 


★  * 

/  -  L'  IUPOKTAUCE  RELATIVE  VES  LANGUES  VMS  LE  UONVE  EN  1917. 

1.1  -  La  situation  actuelle  : 

Depuis  1 'apparition  du  langage,  les  habitants  du  globe  ont  toujours  communique  entre  eux  par 
des  langues  locales,  voire  des  dialectes,  qui  n'btaient  reellement  comprehensibles  que  par  un  nombre 
relativement  faible  de  personnes  constituant  un  groupe  regional  culturel. 

Le  vingtibme  sibcle  a  vu  la  conjugaison  de  plusieurs  phenombnes  :  explosion  de  la  demographie, 
allongement  de  1 'espbrance  de  vie,  accroi ssement  du  commerce  et  dbveloppement  de  la  technologie,  qui  ont 
conduit  les  cinq  milliards  d'hommes  vivant  sur  la  terre  en  1987,  &  communiquer  entre  eux  &  plusieurs 
niveaux. 


Au  niveau  local,  la  langue  -  ou  le  dialecte  -  est  toujours  1'outil  de  communication  du  groupe 
culturel  d'une  region,  avec  cependant  des  caracteri stiques  nouvelles  :  le  nombre  d'habitants  de  certaines 
regions  s'est  beaucoup  accru  au  cours  du  vingtibme  sibcle  (Algerie,  Egypte,  Indonesie,  Pakistan...)  et 
la  bijectivite  frequente  entre  une  langue  culturelle  et  un  etat  a  conduit  b  un  renforcement  de 
1 'appartenance  h  ce  groupe  culturel. 

Au  niveau  global,  les  echanges  en  tout  genre  qui  se  developpent  depuis  quelques  dizaines  d'annees 
sur  l'ensemble  du  monde  ont  ete,  de  fait,  precedes  pendant  deux  ou  trois  sibcles  d'echanges  impcrtants 
mais  limites  b  ce  que  1'on  pourrait  appeler  les  "grandes  regions"  du  globe.  Et  comme  il  a  bien  fallu  se 
comprendre  pour  commercer  ou  se  battre,  on  constate  qu'aujourd ' hui  onze  langues  sont,  de  fait,  devenues 
"vehiculaires"  pour  1'une  de  ces  regions  : 


-  Mandarin 

-  Indi 

-  Anglais 

-  Espagnol 

-  Arabe 

-  Malais 

-  Russe 

-  Frangais 

-  Swahili 

-  Japonais 

-  Portugais 


Chine 

Inde 

Amerique  Nord  et  Europe  Occidentale 

Amerique  Sud  et  Centre 

Afrique  Nord  et  Moyen-Orient 

Asie  du  Sud-Est 

Europe  de  l'Est 

Europe  Occidentale  et  Afrique 

Afrique  de  1 'Est 

Japon 

Amerique  du  Sud 


1.2  -  La  tendance  : 


L'ensemble  des  grands  equilibres  demographiques,  socioeconomiques  et  militaires  ont  toujours 
ete  determinants  pour  1 'importance  d'une  langue,  aussi  est-il  difficile  d'augurer  de  l'avenir.  Cependant, 
si  l'on  exclut,  pour  le  sibcle  prochain,  l'hypothbse  d'un  grand  cataclysme  (biologique,  militalre, 
geologique  ou  cosmique),  le  facteur  demographique  devient  preponderant  et  fait  apparaltre  des  modifications 
profondes  dans  1' importance  des  grandes  langues  du  globe. 


Alors  les  falts  les  plus  marquants  pour  les  cinquante  ans  &  venir  sont  : 

-  la  stabilisation  de  certaines  langues  :  mandarin,  indi,  frangais,  russe,  japonais, 

-  la  croissance  du  swahili,  malais  et  portugais, 

-  la  croissance  rapide  de  1'espagnol 

et  la  croissance  encore  plus  rapide  de  l'arabe  qui  pourrait  ddpasser  le  mandarin  au  milieu  du  sifecle 
prochain. 


£n  quelque  sorte  la  part  des  langues  actuellement  trds  Importantes  en  nombre,  comme  le  mandarin, 
l'indi  et  l'anglais  va  diminuer  au  profit  essentiellement  de  1'espagnol  et  surtout  de  l'arabe. 

Par  ailleurs,  les  choix  stratdgiques  politiques,  comme  la  decision,  en  1928,  de  Mustapha  Kemal 
ATATURK  d'lmposer  l'alphabet  romain  en  Turquie  et  la  decision  de  basculer  sur  l'alphabet  cyrillique  pour 
les  esquimaux  d'U.R.S.S.,  sont  b  la  fois  particulibrement  importants  au  plan  de  1'exemplaritd  et  de  faible 
Importance  sur  le  plan  quantitatif  de  1'dvolutlon  des  langues  pour  1 'ensemble  du  globe. 

t  -  L' USAGE  DCS  UNGUES  EM  19t7  PAMS  LE  POUAM  VES  SCIENCES  ET  TECHNIQUES. 


2.1  -  Mdcessitd  de  la  cowwinication  scientifique  Internationale  : 

Pour  un  pays,  les  grands  objectifs  culturels  de  Vinformation  scientifique  et  technique  ont  dtd 
synthdtisds  dans  une  circulaire  (6)  du  ministre  de  la  recherche  frangais  en  septembre  1981  : 

-  exprimer  1'identitd  scientifico-culturelle  d'un  peuple, 

-  assurer  1'accbs  de  ce  peuple  b  la  modernity,  dans  sa  propre  langue, 

-  assurer  ddmocratiquement  l'dgalitd  de  l'accbs  b  la  connaissance  par  le  passage  de  1' information 

scientifique  et  technique  dans  1 'ensemble  du  tissu  social  d'une  nation. 

Ce  troisibme  point  n'dtant  que  la  reprise  d'un  concept  du  philosophe  ALAIN  qui  ddveloppait  l'idde 

que  "le  niveau  de  culture  d'un  pays  n'dtait  pas  ddfini  par  celui  de  Platon  mais  bien  par  celui  de  sa 

servante". 

Pour  un  organisme  de  recherche,  cette  communication  Internationale  est  aussi  ndcessaire  pour 
la  defense  de  la  quality  de  1 'organisme  (son  image)  ou  plus  simplement  pour  la  justification  de 

1 'utilisation  des  fonds  allouds.  Nous  savons  tous  que  les  grands  congrfes  pdriodiques  internationaux  sont 
en  fait  1 'occasion  donnde  aux  organismes  de  recherche  de  rendre  compte  de  leurs  travaux  devant  l'opinion 
pub li que. 


Au  niveau  de  1'individu  enfin,  la  publication  reste  toujours  ndcessaire,  malgrd  la  rdalitd  du 
"college  invisible"  constitud  par  les  personnes  qui  sont  concerndes  par  un  meme  axe  de  recherche,  car 
le  probldme  d 'antdriori  td  se  pose  souvent,  de  mfiine  que,  tout  simplement,  la  consolidation  du  prestige 
personnel  du  chercheur  ;  il  y  a  ddjb  plusieurs  dizaines  d’anndes,  en  effet,  que  le  dilemme  publish  or 
perish  est  bien  ancrd  dans  la  tete  de  tous  les  scientifiques. 

En  tout  dtat  de  cause,  c'est  bien  en  science  et  en  technologie  que  les  investissements 
intellectuels,  technologiques  et  financiers  sont  les  plus  grands  et  done,  globalement,  c'est  bien  lb  que 
la  ndcessitd  de  la  meilleure  information  est  la  plus  impdrieuse.  Par  ailleurs,  un  rapport  du  D.O.E.  (7) 
sur  1'impact  de  1 ' Information  Scientifique  et  Technique  montre  que  1 '  investissement  en  I.S.T.  est 
parti culihrement  rentable  pour  la  recherche. 

2.2  -  Le  type  de  la  co—inication  :  orale,  dcrite  : 

Toutes  les  analyses  (1,  2,  3,  4,  5,  6)  font  apparaitre  que  l'anglais  est,  de  fait,  la  langue 
vdhiculaire  la  plus  pratique  pour  la  majority  des  scientifiques. 

C’est  actuellement  la  langue  la  plus  rdpandue  quantitativement  parmi  les  scientifiques  et 
techniciens  du  monde  entier,  et  elle  est,  de  plus,  situde  dans  le  groupe  des  langues  qui,  sur  le  plan 
de  la  qualitd  des  publications  sont  les  plus  intdressantes. 

L'anglais  dtant  done  actuellement  la  langue  vdhiculaire  de  la  communication  scientifique,  ceci 
amdne  une  diffdrence  fondamentale  entre  le  scientifique  de  langue  anglaise  et  les  autres. 

En  ce  qui  concerne  la  coMwnication  orale  (3,  4),  et  particulibrement  pendant  les  meetings,  congrbs 
ou  confdrences,  les  anglophones  ont  un  dnorme  avantage  naturel  qui  fait  des  non  anglophones  les  grands 
perdants  de  la  communication  b  tous  les  niveaux.  Au  niveau  du  message  bien  sOr,  mais  aussi  et  surtout 
b  celui  de  la  relation  affective  :  la  perte,  dirait  MAC  LUHAN,  est  double,  b  la  fois  sur  le  "message" 
et  sur  le  "massage". 

Ce  phdnombne  est  encore  amplifid  par  le  fait  que  beaucoup  de  scientifiques  anglais  ou  amdricains 
ne  comprennent  que  l'anglais  et  qu'ils  imposent  en  quelque  sorte,  b  1 'ensemble  d'un  groupe,  le  choix  de 
la  langue  anglaise  oti  11s  sont  bien  entendu  le  plus  b  1'aise.  Nous  avons  tous  en  mdmoire  des  situations 
ou  un  groupe  dlscutant  en  allemand,  frangais  ou  espagnol  bascule  sur  l'anglais  b  1'arrivde  d'une  seule 
personne,  au  grand  ddsavantage  de  tous  les  autres*. 


*  Je  me  souviens  d'une  rdunion  de  ndgociation  b  Bruxelles  rdunissant  9  personnes  de  6  pays  :  8  originaires 
de  5  pays  parlant  couramment  le  frangais  et  une  amdricaine  ne  parlant  que  l'anglais. 
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En  ce  qui  concerne  la  coMinlcatlon  gcrite  {3,  4),  la  situation  est  inversge  et  tourne  &  l’avantage 
des  non  anglophones. 

En  effet  (3),  12  *  des  anglophones  peuvent  lire  le  frangais  et  4  %  l'allemand,  alors  que  97  % 
des  francoohones  et  des  germanophones  peuvent  lire  1'annlais.  Ceci  permet  done  aux  chercheurs  non 
anglophones  d'avoir  accfes  &  la  literature  anglophone  qui  repr£sente  50  %  de  la  production  des  publications. 

Ce  r^sultat  est  corrobord  par  l'analyse  du  “Science  Citation  Index"  qui  raontre  que  les  Auteurs 
anglophones  citent  ^  90  *  des  documents  anglophones  (alors  que  la  production  est  de  50  X)  et  I'gventuelle 
difference  de  quality  ne  peut  pas  expliquer  une  si  grande  difference,  d'autant  plus  que  deux  phenomfenes 
jouent  en  sens  Inverse  :  le  fait  que  beaucoup  des  grandes  revues  internationales  sont  en  langue  anglaise 
est  compense  par  le  fait  que  les  documents  achetes  par  les  bibl iothfcques  en  langue  autre  que  l'anglais 
sont  beaucoup  mieux  seiectionnes  (en  raison  de  la  difficult^  d'accbs)  done,  dans  doute,  de  meilleure 
quality,  et  done  plus  important  4  lire. 

Un  point  asset  remarquable  de  la  communication  gcrite  qui  ne  me  paralt  pas  avoir  £t£  assez  signal^ 
est  que  certains  sp^cialistes  du  Comity  de  lecture  d'une  grande  revue  internationale  sont  inform^s  du 
contenu  d‘un  article  six  mois  avant  les  autres  lecteurs.  Tout  se  passe  comrne  si  l'information  allait  aux 
mieux  Inform^s. 

2.3  -  Le  type  de  contenu  : 

A  l'1nt£rieur  des  sciences  et  des  techniques,  la  situation  linguistique  n'est  pas  la  mSme  d'un 
domaine  &  l'autre. 

Si  nous  prenons  1 'exemple  des  scientifiques  frangais  (3),  nous  constatons  : 

-  que  dans  les  domaines  des  math&natiques,  de  la  physique  et  de  la  chimie,  plus  de  la  moitig  des  articles 
d'auteurs  frangais  sont  r&llg^s  en  anglais  et  que  ce  pourcentage  n'a  fait  que  croltre  pendant  les 
derniferes  annles  ; 

-  que  dans  les  domaines  des  "sciences  appliqu^es"  (biologie,  sciences  de  la  terre,  m£decine  et  technologie) , 
un  article  sur  trols  est  rddig£  en  anglais,  et  que  la  croissance  du  nombre  des  publications  en  langue 
anglaise  semble  s'?tre  arret^e. 

Par  ailleurs,  nous  constatons  que  la  part  de  la  langue  japonaise  crolt  rapidement,  particul iferement 
dans  la  technologie.  Un  rapport  de  1986  (8)  estime  Si  600  000  le  nombre  de  publications 
scientifico-technologiques  par  an,  dont  moins  de  3  000  font  l'objet  d'une  traduction,  mSme  partielle. 

Ces  deux  exemples  autorisent  le  constat  suivant  les  publications  en  sciences  fondamentales 

s'orientent  vers  l'^criture  en  anglais,  celles  des  sciences  appliqu^es  sont  essentiel lement  £crites  en 
langue  nationale. 

2.4  -  La  tendance  : 

Comrne  l'information  est  plus  vitale  dans  le  domaine  de  la  science  et  de  la  technologie  que  dans 
les  autres  domaines,  1 'abandon  pur  et  simple  du  document  en  langue  £trangbre  est  moins  frequent,  mais 
cependant  encore  alarmant.  L'enquete  sur  les  scientifiques  anglais  (2)  a  montr£  qu'en  1979,  29  %  ne 
faisaient  rien  devant  une  rgfgrence  frangaise,  alors  que  84  X  d'entre  eux  d^claraient  se  d^brouiller  en 
frangais  (ce  pourcentage  d’inaction  totale  montait  &  46  X  dans  le  domaine  des  sciences  sociales). 

Par  ailleurs,  l'analyse  des  500  000  rapports  sur  l'gnergie  detenus  S  Saclay,  provenant  de  45 
pays,  montre  qu'ils  ont  £t4  collects  pendant  trente  ans,  parce  que  r£dig#s  en  langage  dit  "accessible", 
c'est-S-dire  essentiel lement  en  alphabet  roman  et  cyrillique. 

La  r&gle  semblait  done  souvent  1'abandon  pur  et  simple  du  document  "diffici lement  accessible". 

Or,  nous  savons  maintenant,  gr3ce  Si  l’enquete  (7)  portant  sur  60  000  ing^nieurs-chercheurs 
subventionn^s  par  I'US.OOE  qu'un  "chercheur"  lit  en  moyenne  un  document  par  jour  de  travail  (220/an). 

Ce  chiffre  de  "un”  document  lu  par  jour  et  par  individu  a  peu  de  chance  de  croltre,  alors  que 
le  nombre  de  publications  annuel les  augmente  (8). 

Le  systfeme  £tant  manifestement  en  d£sgquilibre  croissant,  la  solution  de  Tabandon  de  documents 
"difficiles  d'acc&s"  paralt  de  plus  en  plus  risquSe  et  des  adaptations  seront  done  ingluctables. 

On  volt  d'ailleurs  apparattre  ici  que  la  quantity  de  documents  sera  en  soi  une  grande  difficult^, 
m&ne  si  tous  les  documents  £taient  Merits  dans  la  mSme  langue.  II  est  probable  que  ce  problfeme  sera  abord£ 
par  les  deux  bouts.  Si  la  lecture  :  s^lectlonner  mieux  les  documents  ;  S  1'gcriture  :  6crire  plus  court 
et  moins  souvent*,  puisqu'en  cette  matiferc  les  auteurs  et  les  lecteurs  sont  les  m&nes  personnes. 


*  Un  rapport  du  NTIS  est  lu  en  moyenne  9  fois,  alors  que  certains  (best  sellers)  sont  vendus  50  000  fois... 
II  y  en  a  done  quelques  uns  qui  ont  £td  dcrits  pour  ...  personne  ! 
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3  -  les  mmavKS  ve  kesoumoh  ve  la  baksieke  LJteuisnm. 

3.1  -  L'apprentlssage  des  langues  btranqbres  : 

Les  grands  bquilibres  socio-bconomlques,  politiques,  religieux  et  militalres  ont  toujours  dbterminb 
le  choix  de  I'btude  d'une  seconde  langue. 

Alnsi,  pendant  longtemns,  pour  btre  au  courant  de  ce  qul  changeait  dans  le  monde,  une  seule  langue 
sufflsait  :  le  latln  jusqu'au  16>me  sibcle  et  le  frangais  jusqu'au  19bme. 

Ensuite,  et  jusqu'en  195u,  11  sufflsait  d'avolr  la  connalssance  de  trois  langues  romanes  (frangais, 
allemand,  anglais)  pour  btre  informb  de  la  science  et  de  la  technologle,  c'btait  done  encore  possible. 

Depuls,  11  a  fallu  ajouter  une  langue  cyrllllque  :  le  russe,  et  aujourd'hul,  une  langue 
Idbographique,  le  japonals. 

Oemaln ,  11  va  falloir  ajouter  une  autre  langue  idbographique,  le  chlnols,  et  deux  autres  langues 
romanes,  l'espagnol  et  le  portugals  i  et  aprbs  demaln,  une  langue  arablque. 

En  consequence,  l'apprentlssage  de  toutes  les  langues  btrangbres  nbcessaires  b  la  comnunl cation 
sclentlfique  n'est  pas  la  bonne  vole,  d'autant  plus  que  du  c6t b  de  l'offre,  la  situation  s'est  beaucoup 
dbtbriorbe  pulsque  seuls  1'anglals  et  le  frangais  contlnuent  b  btre  enseignbs  dans  la  plupart  des  pa ys 
du  monde,  ce  qul  n'est  pas  de  cas  du  russe,  du  chlnols,  du  japonals  et  de  l'arabe  et  plus  le  cas  (6)  de 
1 'allemand  et  de  l'espagnol. 

Deux  autres  solutions  seralent  done  que  tout  le  monde  parle  la  mbme  langue  ou  que  tous  les 
sclentiflques  apprennent  une  mbme  “seconde  langue  sclentiflque"  codbe  come  1'algbbre,  et  precise  comme 
les  systbmes  d' uni  tbs. 

3.2  -  Une  langue  Internationale  unique. 

La  solution  la  plus  radlcale  serait  blen  sftr  la  crbatlon  d'une  langue  Internationale  qul 

bliminerait  dbfinitlvement  la  barribre  linguistique  et  e'est  pour  attelndre  cet  objectif  que  l'esperanto 
a  btb  erbb.  Bien  que  l'opbration  “esperanto"  solt  Importante*  (5),  elle  est  cependant  restbe  b  un  stade 
de  dbveloppement  trop  falble  pour  que  1 ' Investlssement  nbcessaire  b  l'apprentlssage  de  la  langue  solt 
rentable  pour  1 ' 1 ndlvldu.  II  faut  dire  que  le  dbveloppement  de  l'esperanto  s'est  trouvb  concomitant  de 
1 'explosion  dbmographlque  et  pendant  que  quelques  millions  d'houmes  apprenalent  l'esperanto,  quelques 
centaines  de  millions  de  chinois  apprenalent  “naturellement"  le  mandarin. 

Le  problbme  de  la  langue  unique  se  pose  un  peu  comme  pour  le  tblbphone  ou  la  messagerie 
blectronique  :  le  grand  nombre  d'abonnbs  joue  un  rble  de  premlbre  importance,  la  qualitb  est  un  peu  le 

sous-prodult  de  la  quantity.  La  langue  unique  universelle  reste  done  fortement  Improbable  dans  un  avenir 

proche,  car  elle  nbcessiterait  1 'apprentissage  slmultanb  de  cette  langue  par  deux  ou  trois  milliards 

d' hommes. 


3.3  -  Une  langue  Internationale  pour  la  co— unication  sclentlfique  et  technique. 

L'anglais  est  la  langue  maternelle  de  250  millions  d'hommes  et  la  deuxibme  langue  pour  bgalement 
250  millions  d'hommes,  mals  ces  dernlers  ne  sont  pas  dlstrlbubs  au  hasard  parmi  les  activitbs  des  humains. 
II s  sont  concentrbs  dans  des  domalnes  d'activitb  b  haute  densitb  de  communication  :  essentiellement  le 
commerce  ou  70  *  du  courrier  international  -  surtout  commercial  -  est  en  anglais,  le  domaine  de  la  radio 
et  de  la  tblbvision,  le  domaine  des  transports  (aviation  et  marine)  et  le  domaine  de  la  recherche  et  du 
dbveloppement  en  science  et  en  technologle. 

Aujourd'hul  75  X  des  btudiants  d'bcoles  secondalres  du  monde  non  anglais  (b  1 'exception  de  la 
Chine)  apprennent  l'anglais. 

L'anglais  peut  done  Stre  considdrd  comme  la  langue  vbhiculaire  de  la  communication  scientifique 
et  technique  et,  particulibrement  dans  le  domaine  sclentlfique  oil  50  X  de  la  littbrature  est  en  anglais, 
on  peut  consldbrer  que  la  barribre  linguistique  est  dbpassbe,  mbme  si  ce  n'est  peut-btre  que  provisoire. 

En  effet,  la  quantltb  de  sclentiflques  qui  parlent  vraiment  l'anglais  risque  de  se  stabiliser 
assez  rapidement  car  le  nombre  des  btudiants  qui  iront  dans  des  universitbs  de  langue  an glaise  (pour  des 
btudes  ou  des  recherches)  ne  pouvant  croitre  Indbflniment,  la  proportion  de  ces  rbels  bilingues  va 
dberottre. 

4  -  LA,  8AKKIEEE  CULTURE  LLE. 

4.1  -  Les  diffbrentes  approches. 

Sans  chercher  b  dbfinir  la  culture,  11  est  certain  que  la  combinaison  des  aspects  gbographiques 
(terre,  mer,  plaine,  montagne...)  et  des  aspects  climatiques  (tempbrature,  hygrombtrie. . . )  a  btb  primordiale 
pour  la  constitution  initiate  des  "groupes  culturels"  de  notre  espbee. 

Ensuite,  1 'apparition  d'une  langue  commune  b  un  groupe  a  permis  de  transmettre  la  culture  du 
groupe,  mals  aussi  de  structurer  la  socibtb  et  d'en  assurer  alnsi  la  pbrennitb. 


*  Avec  100  journaux  et  20  000  publications  par  an,  l'esperanto  compte  quelques  millions  de  pratiquants 
(1  b  8)  et  rassemble  b  son  congrbs  annuel  4  500  personnes  provenant  de  50  pays. 


2-6 


La  parole  a  amen£  une  nouvelle  repartition  du  pouvoir  dans  la  socidte  et  1 'apparition  de  1  ecriture 
a  conduit  de  nouveau  &  une  nouvelle  redistribution  de  ce  pouvoir.  -  Le  pharaon  de  1  epoque  avait  bien 
raison  de  se  mettre  en  colfere  et  de  s'arracher  les  cheveux  lorsqu'on  lui  a  presente  cette  invention. 

Le  developpement  technologique  du  vingtifeme  sifecle  a  eu  un  impact  si  considerable  sur  la  vie 
culturelle  des  societes  que  l'on  n'en  mesure  peut-etre  pas  encore  la  portae.  Si  certaines  avancees 
technologlques  sont  bien  connues,  d'autres  sont  peut-etre  aussi  importantes  sociologiquement  :  deyerser 
du  chi  ore  dans  les  rivieres  est  peut-Stre  un  facteur  aussi  important  que  la  penicilline  pour  lesperance 
de  vie  ;  les  religions  sont  conduites  &  prendre  en  compte  la  synthfese  des  elements  par  les  etoiles  et 
1 'expansion  de  l'univers,  et  le  nylon,  par  1 'intermediaire  du  collant  qui  a  permis  la  mini-jupe,  a  sans 
doute  participe  de  la  liberation  de  la  femne  dans  certaines  societes. 

Psychologiquement,  la  fonction  de  la  langue  n'est  pas  uniquement  de  satisfaire  le  besoin  de 
reconnaissance  d'un  invididu,  mais  aussi  son  besoin  de  consideration.  En  fait  la  langue  renforce  le 
sentiment  d'appartenance  d'un  individu  Si  un  groupe  et  inversement  le  besoin  de  connivence  d'un  groupe 
-  pour  se  situer  par  rapport  Si  d'autres  groupes  -  developpe  la  mobilite  de  la  langue  vers  un  dialecte 
ou  m8me  un  argot  qui,  lui-meme,  change  quand  le  groupe  s'apergoit  qu'il  diffuse  dans  les  autres  groupes. 

Dfes  que  l'approche  de  la  barrifere  culturelle  est  un  peu  psychanal itique,  on  s'apergoit  du  role 
preponderant  joue  par  1 1 inconscient.  Oeux  enqu^tes  americaines  rapportees  dans  "The  foreign  Language 
Barrier"  (5)  montrent  que  les  deux  facteurs  determinants  pour  exploiter  la  literature  soyidtique  etaient 
la  connaissance  familifere  de  I’U.R.S.S.  et  la  bonne  opinion  de  la  recherche  en  Union  Sovietique,  et  ceux 
qui  n'ont  pas  une  bonne  opinion  de  cette  recherche  ne  citent  jamais  un  auteur  sovietique. 

Sur  le  plan  linguistique  pur,  si  1 ‘importance  des  paradigmes  est  facile  Si  mettre  en  evidence 
(l'arabe  possfede  plus  d'une  centaine  de  mots  pour  designer  le  chameau  alors  qu'il  n'y  en  a  aucun  pour 
designer  la  table  qui  etait  un  objet  inconnu),  1‘axe  syntagmatique  est  tout  aussi  important  car  il  structure 
la  pensee,  parfois  de  faqon  exceptionnelle.  -  Certaines  langues  n'ayant  pas  la  forme  du  futur  (ou  du  passe), 
1' impact  sur  la  pensee,  la  religion  et  la  philosophie  est  considerable.  - 

Le  cerveau  de  1'Stre  humain  qui,  une  fois  adulte,  fonctionne  essentiel lement  par  associations 
et  par  images  (fantasmes)  se  developpe  dans  la  toute  petite  enfance  Si  la  fois  par  le  langage  et  par  la 
pensee  ;  c'est  pendant  cette  phase  de  croissance  que  la  pensee  structure  le  langage  mais  aussi  que  le 
langage  structure  la  pensee.  Le  resultat,  bien  sQr  culturel,  influence  les  approches  scientifiques  et 
technologiques.  -  La  conception  structurelle  de  certains  ordinateurs  japonais  est  impregnee  de  culture 
japonaise. 

4.2  -  Les  enjeux. 

4.2.1  -  L'enjeu  culturel . 

A  chaque  innovation  technique,  la  quantity  d‘ informations  dont  il  a  fallu  disposer  pour  r^aliser 
un  objet  s'est  accrue.  Par  exemple,  le  coOt  de  realisation  du  film  TRON  -  ou  de  l'ordinateur  LISA  -  est 
de  50  M.  $,  soit  ZOO  hommes  X  annges. 

Les  investissements  immatgriels  (intellectuels)  vont  devenir  de  plus  en  plus  grands  pour  fabriquer 

les  nouveaux  produits,  conroe  si  la  society  de  production  allait  se  transformer  en  society  de  creation. 

La  grande  technologie  du  21feme  sifccle.sera  celle  de  1 ' information  sous  toutes  ses  formes. 

Chaque  innovation  technique  a  toujours,  plus  ou  moins  rapidement,  modeie  les  rapports  de  forces 
entre  les  nations  ainsi  que  les  moeurs  de  ces  m@mes  nations.  Ce  qui  fut  particul iferement  vrai  pendant 
longtemps  pour  les  armes,  puis  pour  les  biens  de  consommation,  le  devient  maintenant  pour  les  "biens  de 
communication". 

Il  est  symbol ique  que  les  deux  premiers  plus  gros  ordinateurs  au  monde  (deux  CRAY  XMP)  aient 
dtd  achetls,  1'un  par  le  D.O.D.  (Department  de  la  Defense  des  U.S.A.),  et  l'autre  par  une  petite  entreprise 
de  spectacle  amgricaine  ("Digital  Production"),  d'une  trentaine  de  personnes,  specialises  dans  la  synthfese 

des  images.  Comme  si  le  pouvoir  et  la  predominance  des  etats  se  jouaient  sur  les  deux  fronts,  celui  de 

la  guerre  passive  :  les  armes,  et  celui  de  la  guerre  active  :  la  culture. 

4.2.2  -  L'enjeu  de  la  langue. 

Il  ne  faut  pas  rSver,  une  langue  n'est  ni  neutre,  ni  innocente  ;  elle  constitue,  dans  cette 
nouvelle  forme  de  guerre  economico-culturelle,  1'un  des  enjeux  les  plus  importants. 

C'est  sans  doute  pour  cela  que  la  langue  comnune  i  1 'ensemble  des  habitants  du  globe  a  peu  de 
chances  de  r^ussite  dans  un  avenir  proche. 

De  plus,  cette  solution  unique  n'est  sans  doute  pas  souhaitable  car  elle  condulrait  i  trois 

risques  : 

-  pauvretg  culturelle  generalise  par  uniformisation, 

-  structuration  unique  de  la  forme  de  pensee, 

-  deliquescence  de  la  langue  cholsie. 

En  outre,  cela  ne  conduirait  pas  forc&nent  ni  &  la  paix  mondiale  (actuellement  la  plupart  des 

bell1g£rants  dans  le  monde  parlent  la  m@me  langue),  ni  mSme  %  une  meilleure  efflcacite  dans  le  traltement 

des  Informations  (le  fonctionnement  de  la  communication  n’est  pas  plus  performant  dans  les  grandes 

entreprises  multinationales). 


5  -  L ES  FAITS  rmtt&S  V/WOin  SUH  L'ETAT  LJUGUISHOUE  VU  HONK  AU  VEBUT  VU  ilimt  S1ECLE. 

L'ytat  linguistique  du  monde  au  21feme  sifecle  sera  probablement  le  suivant  : 

-  au  niveau  du  “groupe  d'appartenance",  une  langue  “nationale"  riche,  socio-culturelle,  porteuse  de 
1 'Identity  d'un  peuple.  Ces  langues  assurant  les  relations  profondes  entre  les  huroains  resteront  trfes 
nombreuses  i 

-  au  niveau  regional,  une  petite  dizalne  -  peut-Stre  une  derai-douzaine  -  de  langues  simples  permettront 
la  comunlcatlon  dans  chaque  grande  region  du  globe  ; 

-  au  niveau  mondial,  une  ou  deux  langues  vdhiculaires,  surcodSes  mais  pauvres,  permettront  d'assurer  la 
conmunicatlon  "yconomique"  globale. 

Tout  d'abord,  quelques  remarques  s'imposent  : 

-  pour  faire  aussi  partie  d'un  autre  groupe  d'appartenance,  il  faudra  apprendre  ryellement  la  langue  de 
ce  groupe  ; 

-  pour  avoir  une  action  international,  chacun  devra  faire  1 'effort  d'apprendre  la  langue  de  la  region 
et  la  langue  Internationale  ; 

-  c'est  toujours  le  besoin  -  ou  l'envie  -  qul  permet  de  sauter  la  barribre  linguistique  :  la  base  PASCAL 
du  C.N.R.S.  est  plus  utilisde  par  le  Japon  que  par  les  autres  pays  europSens  qui  utilisent  pourtant 
des  langues  romanes  ; 

-  la  communication  dans  les  sciences  appliqudes  -  la  "technologie"  -  se  fera  de  plus  en  plus  dans  la  langue 
nationale  d'appartenance  (8)  parce  qu'elle  va  concerner  la  totality  de  la  population  du  groupe,  et  peut- 
Stre  aussi,  dans  certains  cas,  pour  des  raisons  de  confidentiality  industrielle. 

En  attendant  1'outil  miraculeux  qui  permettra  de  s'exprimer  vocalement  dans  s  a  langue 
d'appartenance,  alors  que  1 ' interlocuteur  entendra  directement  dans  la  sienne  -  et  ryciproquement  -  un 
certain  nombre  de  choses  peuvent  Stre  tentSes  pour  rdduire  la  barriSre  linguistique  : 

-  en  ce  qui  concerne  la  communication  dcrite,  une  voie  pourrait  Stre  la  traduction  systSmatique,  et  en 
totality,  des  revues  en  une  dizaine  de  langues  ;  le  premie1"  pas  vers  cette  traduction  systdmatique 
pourrait  concerner  la  traduction  des  rSsumSs  des  revues  seconda ires  afi n  qu'un  document  russe  ou  chinois 
puisse  etre  retrouvd  automatlquement,  quelle  que  soit  la  langue  dans  laquelle  le  rdsumd  a  dtd  traduit  ; 

-  en  ce  qui  concerne  la  communication  orale,  le  premier  objectif  l  atteindre  est  1 'accoutumance  de  l'homme 
au  multilinguisme  (au  moins  deux),  avec  habitude  de  s'en  servir,  ceci  dtant  particullferement  important 
pour  les  anglophones  de  langue  maternelle  anglaise.  L’une  des  solutions  les  plus  habiles  (6)  serait 
peut-etre  d'utiliser  systymatiquement  deux  langues  pour  toute  intervention  :  la  langue  d'appartenance 
pour  1 'exposd  et  la  langue  rdgionale  (ou  Internationale)  pour  les  informations  ycrites  projetyes  ;  ou 
bien  sQr  l'inverse,  suivant  les  circonstances  et  les  possibilitSs  linguistiques  de  1' intervenant. 

Un  second  objectif  S  atteindre  serait  peut-etre  la  prise  de  conscience  de  la  duality 
purety/richesse,  en  effet,  tous  les  "dyfenseurs"  d'une  langue  "nationale”  ont  tendance  h  travailler  "en 
dyfense",  essentiellement  en  dyfense  de  la  purety  de  la  langue,  au  dytriment  de  sa  richesse  et  de  sa  vie. 
Une  expyrlence,  comme  ce  que  les  sud-amyricains  appellent  le  "portugnol"  (mSlange  d'espagnol  et  de  portugais 
et  non  alternance  de  phrases  dans  les  deux  langues)  me  paralt  intyressante  Si  suivre. 

Une  autre  expyrlence  &  suivre  est,  yvidemment,  celle  des  organismes  internationaux. 

Ils  fonctionnent  gynyralement  avec  un  petit  nombre  de  langues  officielles,  deux  pour  l'Agence 
International  de  l'Energie  (A.I.E.),  quatre  pour  l'Agence  Internationale  pour  l'Energie  Atomique 
(A.I.E.A.),  etc...,  mais  dans  certains  cas,  le  nombre  des  langues  devient  pryoccupant.  Ainsi  la  Communauty 
Economique  Europyenne  (C.E.E.),  qui  regroupe  douze  pays,  fonctionne  avec  neuf  langues  officielles,  et 
consacre,  en  consyquence,  plus  de  la  moitiy  de  ses  dypenses  de  fonctionnement  &  1 'activity  traduction 
et  interprytariat.  Cette  activity  concerne  plus  du  tiers  des  agents  permanents  de  la  Commission  des 
Communautys  Europyennes  (C.E.E.)  et  plus  de  la  moitiy  de  ceux  du  Parlement  Europyen  ! 

MSme  si  cette  mycanique  de  traduction  systymatique  en  neuf  lar.oues  peut  apparaltre  comne 
monstrueuse  -  voire  complbtement  dybile  -  il  me  semble  extrgmement  important  de  suivre  Involution  de 
cette  expyrience  car  elle  porte  justement  sur  une  dizaine  de  langues,  c'est-J-dire  le  m&ne  nombre  que 
celui  des  grandes  langues  rygionales  du  globe.  Ne  serait-ce  pas  encore  une  fois  une  expyrience  d'avant  - 
garde  tentye  par  la  "Vieille  Europe",  en  avant-premifere  et  en  vraie  grandeur,  de  ce  qui  se  passera  dans 
le  monde  entier  ? 
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JS  U  M  M  A  R  Y 

first  chapter,  describing  the  relative  importance  of  languages  in  the  world  in  1 987,  shows  that  the  -.vcild  population  makes 
use  of  about  ten  languages,  each  of  which  covers  an  area  of  the  globe.  The  trends  are  revealed  by  an  analysis  of  population 
density  and  socio-economic  balances. 

The  use  of  languages  is  studied,  in  the  second  chapter,  in  the  field  of  science  and  Technology,  according  to  the  type  of 
communication  (written  or  spoken)  and  (he  contents  (from  mathematics  to  technology)  and  the  tendencies  are  shown. 

A  third  chapter  is  devoted  to  the  attempts  which  have  been  made  to  overcome  the  linguistic  barrier  —  such  as  the  introduction 
of  a  single  common  language  (esperanto,  English)  or  the  development  of  the  study  of  foreign  languages.  The  problems  of 
translation,  linguistics  and  the  computer  are  not  dealt  with. 

The  detailed  analysis  of  the  cultural  barrier,  in  the  fourth  chapter,  is  carried  out  by  several  approaches  (sociological, 
psychosociological,  psychoanalytic  and  psycholinguistic)  in  order  to  reveal  the  guenine  individual  and  collective  stakes. 

Lastly,  a  last  chapter  concentrates  on  future  trends,  such  as  the  working  of  the  Commission  of  the  European  Communities  with 
its  nine  official  languages,  and  to  put  forward  some  predictions  on  the  linguistic  state  of  the  world  at  the  beginning  of  the  2 1st 
century.^ 
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I  RELATIVE  IMPORTANCE  OF  LANGUAGES  IN  THE  WORLD  IN  1 987. 

1 . 1  Current  situation 

Since  language  has  appeared,  the  world  population  has  always  communicated  between  them  by  local  languages,  or  even 
dialects  which  were  only  comprehensible  by  relatively  few  people  setting  up  a  cultural  regional  group. 

The  twentieth  century  saw  the  combination  of  several  phenomena:  boom  of  demography,  lengthening  of  life  expectancy, 
increase  in  trade  and  development  of  technology,  which  led  the  five  thousand  million  people  living  on  earth  in  1987,  to 
communicate  at  several  levels. 

At  the  local  level,  language  —  or  dialect  —  remains  the  communication  tool  of  the  cultural  group  of  an  area,  but  with  new 
characteristics:  the  number  of  inhabitants  of  certain  areas  has  increased  considerably  during  the  twentieth  century  (Algeria, 
Egypt,  Indonesia,  Pakistan, ...)  and  the  osmosis  between  cultural  language  and  a  State  led  to  a  membership  strengthening  of  this 
cultural  group. 

At  the  overall  level,  exchanges  of  any  kind  which  have  been  developed  during  the  last  decades  throughout  the  world  have  been 
preceded  in  fact  by  important  but  limited  exchanges  in  the  “large  areas"  of  the  world.  And  as  it  had  been  first  necessary  to 
understand  each  other  for  trading  or  fighting,  only  eleven  idioms  became  the  sole  communication  means,  for  one  of  these  areas: 


Mandarin 

China 

Indi 

India 

English 

North  America  and  Western 

Spanish 

South  and  central  America 

Arabic 

North  Africa  and  the  Middle 

Malay 

South-east  Asia 

Russian 

Eastern  Europe 

French 

Western  Europe  and  Africa 

Swahili 

East  Africa 

Japanese 

Japan 

Portuguese 

South  America 

1.2  Tendency 

The  whole  of  the  major  demographic,  socio-economic  and  military  balances  was  always  determining  for  the  importance  of 
language,  and  so  is  it  difficult  to  forecast  the  future.  Yet,  if,  for  the  next  century,  the  assumption  of  major  cataclysm  is  excluded 
(biological,  military,  geological  or  cosmic),  the  outstanding  factor  is  demography  and  it  reveals  deep  changes  in  the  importance 
of  the  major  wordls  languages. 

Then  the  most  outstanding  facts  for  the  fifty  years  to  come  are: 

—  Stabilization  of  certain  languages:  Mandarin.  Indi,  French,  Russian,  Japanese, 

—  Growth  of  Swahili,  Malay  and  Portuguese, 

—  Rapid  growth  of  Spanish  and  the  even  more  rapid  growth  of  Arabic  which  could  be  used  more  than  Mandarin  in  the 
middle  of  the  next  century. 

To  some  extent,  the  share  of  currently  very  important  languages  in  number,  like  Mandarin,  Indi  and  English  will  decrease  to  the 
profit  of  Spanish  and  above  all  Arabic. 

In  addition,  political  strategic  choices,  for  example,  Mustapha  Kemal  ATATURK’s  decision  in  1928  to  impose  the  Roman 
alphabet  in  Turkey  or  the  decision  to  use  the  Cyrillic  alphabet  for  USSR  Eskimo  are  particularly  important  as  far  as 
exemplarity  is  concerned  and  of  very  limited  importance  as  far  as  the  quantitative  level  of  languages  evolution  is  involved. 

2  THE  USE  OF  LANGUAGES  IN  1 987  IN  THE  FIELD  OF  SCIENCES  AND  TECHNIQUES. 

2. 1  Need  for  international  scientific  communication: 

For  a  country,  the  broad  cultural  aims  of  scientific  and  technical  information  were  synthesized  in  a  circular  (6)  from  the  French 
Minister  of  Research  in  September  1981: 

—  to  express  the  scientific  and  cultural  identity  of  people, 

—  to  make  modernity  available  to  the  people,  in  their  own  language, 

—  to  ensure  democratic  equality  to  knowledge  through  the  circulation  of  scientific  and  technical  information  within  the 
social  fabric  of  a  nation. 

This  third  idea  being  just  a  new  formulation  of  ALAIN’s  philosophical  concept  when  he  stated  that  “the  cultural  level  of  a 
country  is  not  determined  by  that  of  Plato  but  by  that  of  his  maid". 


For  a  research  organisation,  this  international  communication  is  also  necessary  for  promoting  its  quality  (its  image)  or  more 


simply  for  justifying  the  use  of  the  allocated  funds.  We  all  know  that  the  major  international  periodic  congresses  give  in  fact  the 
opportunity  to  the  research  organisations  to  publicize  their  work. 

Finally  at  the  individual  level,  being  published  is  a  must,  despite  the  reality  of  the  “invisible  college”  composed  of  people  sharing 
the  same  research  preoccupation,  for  the  anteriority  problem  of ten  arises,  as  well  as  merely  the  consolidation  of  the  personal 
researcher’s  prestige;  in  fact,  the  publish  or  perish  dilemma  has  been  experienced  for  several  decades  and  is  deeply  rooted  in  all 
scientists’  heads. 

In  any  event,  science  and  technology  are  the  two  fields  in  which  intellectual,  technological  and  financial  investments  are  the 
largest,  and,  as  a  whole,  this  is  where  the  need  for  best  information  is  most  valuable.  In  addition,  a  DOE  report  (7)  on  the  impact 
of  scientific  and  technical  information  show  that  investment  in  STI  is  particularly  profitable  for  research. 

2.2  —  Type  of  communication:  spoken,  written: 

All  the  analyses  ( 1 ,  2,  3,  4,  5, 6)  reveal  that  English  is,  in  fact,  the  most  practical  common  language  used  by  most  scientists. 

It  is  currently  the  most  widespread  language  used  among  the  scientists  and  technicians  of  the  whole  world,  and  it  stands  in  the 
group  of  languages  which,  in  the  field  of  the  publications  quality,  are  the  most  interesting  ones. 

Since  English  is  currently  the  common  language  of  scientific  communciation,  this  brings  a  fundamental  difference  between 
English  language  scientists  and  the  others. 

With  regard  to  oral  communication  (3, 4),  and  particularly  during  meetings,  congresses  or  conferences,  English-speakers  have 
the  great  advantage  of  the  language  over  non  English-speakers  who  are  the  great  losers  as  far  as  communication  at  all  levels  is 
concerned,  that  means  the  message  obviously,  but  more  especially  the  level  of  emotional  relation:  the  loss,  would  say 
MACLUHAN,  is  double,  both  on  the  "message”  and  the  ’’massage". 

This  phenomenon  is  still  amplified  by  the  fact  that  many  English  or  American  scientists  understand  only  English  and  that  they 
impose  to  some  extent,  to  the  whole  of  a  group,  the  choice  of  English  language  with  which  they  feel  at  ease.  We  all  have  in  mind 
cases  in  which  a  group  discussing  in  German,  French  or  Spanish  start  speaking  English  at  the  coming  of  only  one  English- 
speaker.  to  the  major  disadvantage  of  all  the  others*. 

With  regard  to  written  communication  (3,  4).  the  situation  is  reversed  and  turn  to  the  advantage  of  non  English-speakers. 

Indeed  (3),  12  %  of  English-speakers  can  read  French  and  4  %  German,  whereas  97  %  of  French-speakers  and  German- 
speakers  can  read  English.  This  enables  therefore  non  English-speaking  scientists  to  access  literature  written  in  English  which 
represents  50  %  of  the  publications  production. 

This  result  is  corroborated  by  the  analysis  of  “Science  Index  Quotation”  which  shows  that  90  %  of  documents  written  in  English 
are  quoted  by  English-speaking  authors  (for  a  production  of  50  %)  and  the  possible  difference  in  quality  cannot  explain  such 
difference,  especially  when  one  considers  the  two  following  opposite  phenomena;  Firstly,  though  most  international  reviews 
are  in  English,  the  documents  bought  by  libraries  in  another  language  are  better  selected  (owing  to  the  difficulty  of  access)  and 
therefore,  undoubdedly,  of  a  better  quality  and  are  more  important  to  read. 

A  rather  remarkable  point  of  the  written  communication  which  has  not  been  sufficiently  pointed  out  in  my  opinion,  is  that 
certain  specialists  of  the  reading  Committee  of  an  important  international  review  are  informed  of  the  contents  of  the  article  six 
month  before  the  other  readers,  as  if  information  was  aimed  at  the  best  informed. 

2.3  —  Type  of  contents: 

In  sciences  and  techniques,  the  linguistic  situation  is  not  the  same  from  one  field  to  the  other. 

If  we  take  the  example  of  the  French  scientists  (3),  we  note: 

in  mathematics,  physics  and  chemistry,  more  than  half  of  the  French  authors'  articles  are  written  in  English  and  this 
percentage  has  increased  steadily  for  the  last  years; 

—  in  the  “applied  sciences”  (biology,  earth  sciences,  medicine  and  technology),  one  article  out  of  three  is  written  in  English, 
and  the  rise  in  publications  written  in  English  seems  to  to  have  stopped. 

In  addition,  we  note  that  the  share  of  Japanese  language  rises  quickly,  particularly  in  technology.  A  report  written  in  1 986  (8) 
gave  an  estimated  number  of  600  000  scientific  and  technologic  publications  a  year,  of  which  less  than  3  000  are  fully  or  partly 
translated. 

These  two  examples  justify  the  following  report:  publications  in  fundamental  sciences  are  directed  towards  English  writing, 
whereas  publications  in  applied  sciences  are  essentially  written  in  national  language. 


*  I  remember  a  negotiation  meeting  in  Brussels  assembling  9  persons  from  6  countries:  8  coming  from  5  countries  usually  speaking  French 
and  one  American  speaking  only  English, 


2A-4 


2.4  —  Tendency: 

As  information  is  more  vital  in  science  and  technology  than  in  any  other  fields,  pure  and  simple  giving  up  of  the  document  in  a 
foreign  language  is  less  frequent,  but  is  still  alarming.  A  survey  concerning  English  scientists  (2)  showed  that  in  1979,  29  % 
remained  passive  in  front  of  a  French  reference,  whereas  84  %  stated  that  they  had  a  working  knowledge  in  French  (this 
passivity  reached  46  %  in  social  sciences). 

In  addition,  the  analysis  of  the  500  GOO  reports  on  energy  held  in  Saclay.  coming  from  45  countries,  shows  that  they  have 
been  collected  for  thirty  years,  because  written  in  so-called  “accessible"  language,  i.e.  primarily  in  Romance  and  Cyrillic 
alphabet. 

So  the  rule  often  seemed  to  be  the  pure  and  simple  giving  up  of  the  “difficult  to  reach"  document. 

But  we  do  know  now,  owing  to  a  survey  (7)  concerning  60  000  engineers-researchers  subsidized  by  US.DOE  that  a 
"researcher"  reads  on  average  one  document  a  day  (220/year). 

This  figure  of  “one"  document  read  a  day  and  by  individual  has  little  chance  to  rise,  whereas  the  number  of  annual  publications 
increases  (8). 

As  the  unbalance  of  the  system  is  obviously  increasing,  the  solution  of  giving  up  the  "difficult  to  reach"  document  appears  more 
and  more  risky  and  the  adaptations  will  be  therefore  inescapable. 

It  stands  to  reason  that  the  quantity  of  documents  will  be  a  great  difficulty,  even  if  all  documents  were  written  in  the  same 
language.  This  problem  will  be  probably  tackled  by  both  ends. 

—  at  reading:  to  select  the  documents  better; 

—  at  writing:  to  write  shorter  and  less  often*,  since  in  this  matter  authors  and  readers  are  the  same  people. 

3  ATTEMPTS  TO  OVERCOME  THE  LINGUISTIC  BARRIER. 

3. 1  Foreign  languages  study: 

The  major  socio-economic,  political,  religious  and  military  balances  have  always  determined  the  choice  of  studying  a  second 
language. 

For  a  long  time,  only  one  language  was  enough  to  be  aware  of  what  was  going  on  in  the  world.  Latin  until  the  16th  century  and 
French  until  19th. 

Then,  and  until  1950,  three  Romance  languages  (French,  German.  English)  were  enough  to  be  informed  of  science  and 
technology,  it  was  therefore  still  possible. 

Since,  it  has  been  necessary  to  add  a  Cyrillic  language:  Russian,  and  today,  an  ideogrammatic  language.  Japanese. 

Tomorrow,  it  will  be  necessary  to  add  another  ideogrammatic  language,  Chinese,  and  two  other  Romance  languages.  Spanish 
and  Portuguese;  and  after  tomorrow,  Arabic  language. 

So  the  study  of  all  foreign  languages  to  be  used  in  scientific  communication  is  not  the  right  way,  because  concerning  the  offer, 
the  situation  has  worsened  very  much  since  only  Fnglish  and  French  are  still  being  taught  in  most  of  the  countries  of  the  world, 
but  this  is  not  true  for  Russian,  Chinese,  Japanese  and  Arabic  and  is  not  true  any  more  (6)  for  German  and  Spanish. 

Two  other  solutions  would  be  therefore  that  everyone  speaks  the  same  language  or  that  all  scientists  learn  the  same  “second 
scientific  language"  coded  as  algebra,  and  precise  as  the  unit  systems. 

3.2  A  common  international  language. 

The  most  radical  solution  would  be  of  course,  the  creation  of  an  international  language  which  would  eliminate  the  linguistic 
barrier  definitively  and  it  is  to  reach  this  aim  than  the  esperanto  was  created.  Although  the  operation  "esperanto"  has  been  of 
importance ^  (5),  it  has  remained  however  at  a  development  stage  too  weak  so  that  the  investment  necessary  to  its  study  be 
profitable  for  the  individual.  It  has  to  be  said  that  the  development  of  esperanto  has  been  concomitant  to  the  demographic 
boom  and  while  a  few  million  people  learned  esperanto,  a  few  hundred  millions  Chinese  learned  Mandarin  “naturally". 

The  problem  of  a  common  language  is  slightly  the  same  as  telephone  or  mail  publishing:  the  large  number  of  subscribers  plays  a 
major  part,  quality  is  a  little  the  by-product  of  the  quantity.  A  universal  common  language  remains  therefore  strongly 
improbable  in  the  close  future,  for  it  would  require  the  simultaneous  learning  of  this  language  by  two  or  three  thousand  million 
men. 


*  A  NTIS  report  is  read  on  average  9  times,  whereas  some  (best  sellers)  are  sold  50  000  times ..  There  is  therefore  a  few  ones  which  were 
written  for...  nobody. 

t  With  100  newspapers  and  20  000  publications  a  year,  the  esperanto  is  used  by  a  few  million  people  (1  to  8)  and  its  annual  Congress 
assembles  4  500  people  coming  from  50  countries. 
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3.3  International  language  for  scientific  and  technical  communication. 

English  is  the  mother  tongue  of  250  million  men  and  is  the  second  language  of  othei  250  million  men,  but  the  latter  are  not 
distributed  randmonly  among  human  activities.  They  are  concentrated  in  spheres  of  activity  with  high  communication  density: 
primarily  trade  in  which  70  %  of  the  international  —  mainly  commercial  —  mail  is  in  English,  but  also  radio  and  television, 
transport  (aviation  and  marine),  and  research  and  development  in  science  and  technology. 

Today  75  %  of  the  secondary  school  students  in  the  non  English  world  (except  China)  learn  English. 

Therefore  English  can  be  regarded  as  the  common  language  inof  scientific  and  technical  communication  and,  particularly  in  the 
scientific  field  where  50  %  of  the  literature  is  in  English,  one  can  consider  that  the  linguistic  barrier  is  a  problem  now  out  of  date, 
even  though  this  may  be  provisional. 

Indeed,  the  quantity  of  scientists  who  really  speak  English  is  likely  to  stabilize  rather  quickly  because  as  the  number  of  students 
who  will  go  in  English  language  universities  (for  studies  or  research)  will  not  grow  indefinitely,  the  proportion  of  these  real 
bilingual  persons  will  decrease. 

4  CULTURAL  BARRIER. 

4. 1  Various  approaches. 

Without  trying  to  define  culture,  it  is  quite  clear  that  the  combination  of  geographical  aspects  (earth,  sea.  plain,  mountain.  ..) 
and  climatic  aspects  (temperature,  hygrometry, ...)  has  played  a  major  part  in  the  initial  constitution  of  the  "cultural  groups"  of 
our  species. 

Then,  the  appearing  of  one  language  common  to  a  group  has  allowed  the  culture  of  a  group  to  be  transmitted,  but  also  society  to 
be  structured  and  thus  perenniality  to  be  ensured. 

The  speech  has  brought  new  distribution  of  the  power  in  society  and  the  appearing  of  w  riting  led  again  to  new  redistribution  of 
this  power.  —  The  pharaon  of  this  time  was  well  right  to  become  angry'  and  to  tear  his  hair  w  hen  this  invention  was  presented. 

The  technological  development  of  this  century  has  so  great  an  impact  on  peoples'  cultural  life  that  its  effects  cannot  be 
ascertained  yet.  If  certain  technological  projections  are  well-known,  may  be  others  are  also  sociologically  important:  to  pour 
chlorine  into  the  rivers  is  perhaps  a  factor  as  important  as  penicillin  for  life  expectancy;  the  religions  arc  led  to  take  into  account 
the  synthesis  of  elements  by  stars  and  the  expanding  universe,  and  nylon,  via  sticking  which  allowed  the  mini-jupe.  probably 
took  part  in  the  Women's  Lberation  in  certain  societies. 

Psychologically,  the  function  of  language  is  not  solely  to  satisfy  the  need  of  recognition  of  Man.  but  also  his  need  of 
consideration.  In  fact  language  strengthen  the  feeling  of  belonging  to  a  group  and  conversely  the  need  of  collusion  with  a  group 
—  to  place  in  relation  to  other  groups  —  favours  language  mobility  tow  ards  a  dialect  or  even  slang  w  hich  changes  itself  w  hen  the 
group  realises  that  it  spreads  in  the  other  groups. 

As  soon  as  the  approach  of  the  cultural  barrier  is  somewhat  psychoanalic.  one  realises  the  leading  role  played  by  the 
unconscious.  Two  American  surveys  reported  in  “The  foreign  Language  Barrier"  (5)  show  that  both  determining  in  exploiting 
Soviet  literature  were  the  familiar  knowledge  of  U.R.S.S.  and  the  good  opinion  of  research  in  Soviet  Union,  and  those  who  do 
not  have  any  good  opinion  of  this  research  never  quote  a  Soviet  author. 

At  the  pure  linguistic  level,  if  the  importance  of  paradigms  is  easy  to  demonstrate  (Arabic  has  more  than  one  hundred  words  to 
indicate  "camel”  whereas  there  is  none  to  indicate  "table”  which  was  an  unknown  object),  the  syntagmatic  function  is  quite  as 
important  for  it  structures  thought,  sometimes  in  particular  instances.  —  For  some  languages  which  do  not  have  the  future  (or 
the  past),  the  impact  on  thought,  religion  and  philosophy  are  considerable.  — 

The  human  brain  which,  once  adult,  mainly  functions  through  associations  and  images  (phantasms)  develops  in  the  very  early 
childhood  both  by  the  language  and  thought;  it  is  during  this  growth  phase  that  thought  structures  the  language  but  also  the 
latter  structures  the  former.  The  result  of  this  process,  which  is  of  a  cultural  nature,  influences  the  scientific  and  technological 
approaches.  —  The  structural  conception  of  a  number  of  Japanese  computers  is  impregnated  with  Japanese  culture. 

4.2  Stakes. 

4.2,1  C  ullural  stake. 

To  each  technical  innovation,  the  quantity  of  information  which  has  been  necessary  to  carry  out  an  object  has  increased.  For 
example,  the  realization  cost  of  the  film  TRON  —  or  of  the  computer  LISA  —  is  50  M.$,  i.e.  200  men/years. 

Immaterial  namely  intellectual  investments  will  become  more  and  more  important  to  manufacture  new  products,  as  if  the 
production  society  were  being  transformed  into  a  creation  society. 

The  major  technology  of  the  2 1  st  century  will  be  that  of  information  in  every  aspect. 
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Each  technical  innovation  has  always,  more  or  less  quickly,  determined  the  balance  of  power  between  nations  and  their  way  of 
life.  This  was  particularly  true  for  a  long  period  of  time  with  weapons,  then  with  consumer  goods.  Now  this  is  true  for 
'communication  goods”. 

It  is  symbolic  that  the  first  two  biggest  computers  in  the  world  (two  CRAY  XMP’s)  were  bought,  by  the  D.O.D,  (Department  of 
the  Defence,  U.S.A.)  and  by  a  small  U.S.  show  business  company  ("Digital  Production"),  employing  about  thirty  persons  and 
specialized  in  image  synthesis.  Actually,  it  is  as  if  the  power  and  the  prevalence  of  the  states  were  at  stake  on  two  fronts,  passive 
war:  weapons,  and  active  war:  culture. 

4. 2. 2  Stake  of  language. 

There  is  no  doubt  about  it:  a  language  is  neither  neutral,  nor  innocent;  within  this  new  torm  of  economic  and  cultural  warfare,  it 
it  one  of  the  most  important  stakes. 

It  is  probably  why  the  dream  of  a  language  common  to  everybody  on  earth  has  but  a  few  chances  of  succeeding  in  the  near 
future. 

Additionally,  this  unique  solution  is  probably  not  desirable  since  it  entails  three  risks: 

—  cultural  impoverishment  due  to  uniformity. 

—  unique  structuring  of  the  w  ay  of  thinking, 

—  degeneration  of  the  selected  language. 

Moreover,  this  would  not  lead  to  world  peace  (at  present  most  of  the  belligerents  throughout  the  world  speaks  the  same 
language),  nor  even  to  a  better  effectiveness  in  communication  (the  way  of  communicating  is  not  more  powerful  in  the  major 
multinational  corporations). 

5  PROMISING  FACTS  ON  THE  LINGUISTIC  SITUATION  OF  THE  WORLD  AT  THE  BEGINNING  OF  THE  21st 
CENTURY. 

The  linguistic  condition  of  the  world  in  the  21th  century  will  be  as  follows: 

—  at  the  level  of  the  "membership  group",  "national",  rich,  sociocultural  languages,  representing  the  identity  of  the  people. 
These  languages  ensuring  the  major  relations  between  men  will  remain  very  numerous; 

—  at  the  regional  level,  about  ten  —  perhaps  six  —  simple  languages  to  communicate  within  each  large  world  area; 

—  at  the  world  level,  one  or  two  overcoded  but  poor  common  languages,  will  allow  information  to  be  exchanged  at  the 
overall  "economic"  level. 


But,  before,  a  few  remarks  are  necessary: 

—  to  really  participate  in  this  membership  group,  one  will  have  to  learn  the  language  of  this  group; 

—  to  have  an  international  action,  each  one  will  have  to  endeavour  to  learn  both  the  regional  and  international  language; 

—  it  is  always  the  need  —  or  the  wish  —  which  makes  it  possible  to  overcome  the  linguistic  barrier,  the  PASCAL  base  of 
C'.N.R.S.  is  more  used  by  Japan  than  by  the  other  European  countries  which  use  however  Romance  languages; 

—  communication  in  applied  sciences  —  "  technology  ”  —  will  be  done  more  and  more  in  the  national  membership  language 
(8  (because  it  willcreasingly  concern  the  whole  population  of  the  group,  and  perhaps  also,  in  some  cases,  for  reason  of  industrial 
confidentiality. 

Until  comes  the  miraculous  tool  with  which  one  expresses  vocally  in  his  membership  language,  whereas  the  interlocutor  hears 
directly  in  his  language  —  and  reciprocally  —  a  number  of  things  can  be  attemped  to  reduce  the  linguistic  barrier: 

—  with  regard  to  written  communication,  a  way  could  be  the  systematic  —  and  full  —  translation  of  the  reviews  in  about  ten 
languages;  the  first  step  towards  this  systematic  translation  could  concern  the  translation  of  secondary  reviews'  abstracts  so  that 
a  Russian  or  Chinese  document  can  be  found  automatically,  whatever  the  language  into  which  the  summary  was  translated; 

—  with  regard  to  oral  communication,  the  first  objective  to  be  reached  is  the  man's  familiarization  to  multilingualism  (at  least 
two),  and  the  habit  of  using  it,  this  step  being  particularly  important  for  English  mother  tongue  speakers.  May  be  one  of  the  most 
skilful  solutions  (6)  will  be  to  use  systematically  two  languages  for  any  speech  namely  the  membership  language  for  talks  and 
the  regional  language  (or  international)  for  planned  written  information;  or  of  course  the  reverse,  according  to  the 
circumstances  and  the  speaker’s  linguistic  performance. 

A  second  aim  to  be  reached  would  be  perhaps  the  awakening  of  duality  purity/richness.  Indeed  all  the  “advocates"  of  a 
"national"  language  tend  to  defend  essentially  the  purity  of  language,  to  the  detriment  of  its  richness  and  its  life.  In  my  opinion 
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an  exeriment  to  be  followed  is  that  of  the  "Portugnol"  —  as  South  American  name  it  —  which  is  a  mixture  of  Spanish  and 
Portuguese  with  no  sentences  alteration  in  both  languages. 

Another  experiment  to  be  followed  is,  obviously,  that  of  the  international  organizations. 

They  function  generally  with  a  small  number  of  official  languages,  two  for  the  International  Energy  Agency  ( A.I.E.),  four  for  the 
International  Agency  for  Atomic  Energy  ( A.I.E.A.),  etc...,  but  in  some  cases,  the  number  of  languages  becomes  alarming.  Thus 
the  European  Economic  Community  (C.E.E.),  which  gathers  twelve  countries,  functions  with  nine  official  languages,  and 
devotes,  accordingly,  more  half  of  its  administrative  expenditures  to  the  translation  activity  interpretership.  This  activity 
concerns  more  third  of  the  permanent  agents  of  the  Commission  of  European  Communities  and  more  than  half  of  those  of  the 
European  Parliament. 

Even  if  this  systematic  translation  procedure  into  nine  languages  can  appear  monstrous  —  or  even  completely  weak  —  it  seems 
to  me  very  important  to  follow  the  evolution  of  this  experiment  for  it  covers  precisely  about  ten  languages,  i.e.  the  same  number 
as  that  of  the  major  regional  languages  on  earth.  Would  not  it  be  once  more  an  avant-garde  full-scale  experiment  attempted  by 
the  “Old  Europe",  as  a  prefigurement  of  what  will  occur  in  the  whole  world  ? 
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SUMMARY 

^As  international  computer /communicat ions  networks  proliferate,  the  issue  of  limits  to  information 
transfer  becomes  more  critical.  Individual  businesses  and  even  entire  industries  depend  upon  the  ability 
to  transfer  information  on  a  global  basis  in  a  timely  and  cost-effective  basis.  Additionally,  international 
cooperation  in  science  and  technology  involves  greater  international  collaboration  and  joint  efforts. 
Barriers  that  threaten  this  capability  ultimately  reduce  the  growth  of  markets  and  limit  innovation.  For 
this  reason,  efforts  are  underway  in  bilateral  and  multinational  negotiations  to  reduce  barriers  and 
establish  consensus  guidelines  in  areas  ranging  from  privacy  to  trade.  This  paper  focuses  on  some  of  the 
specific  limitations  to  information  transfer,  including  privacy,  trade  restrictions,  national  security, 
and  telecommunications  regulations.  It  also  highlights  international  efforts  to  reduce  these  barriers  and 
establish  common  approaches  to  facilitate  information  transfer. 
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INTRODUCTION 

The  trend  toward  global  endeavors  in  business,  science,  and  the  arts  is  facilitated  by  the  growth  of 
international  computer /communicat ions  networks.  Intracorporate  networks  allow  for  coordination  of  research, 
production,  and  marketing  functions  that  increasingly  are  dispersed  throughout  the  world.  The  services 
industries,  such  as  banking  and  financial  services,  use  computer/comrounicat ions  networks  to  transfer  infor¬ 
mation  instantaneously  on  a  worldwide  basis.  (1)  World  markets  hav*>  become  vastly  important  to  industries 
that  previously  served  only  domestic  consumers.  Similarly,  the  scientific  community  is  taking  advantage  of 
the  new  information  technologies.  They  use  computers  and  communications  networks  for  a  wide  range  of 
activities.  These  applications  include  gathering  data  via  computerized  instruments,  using  computer  models 
to  simulate  experiments,  performing  massive  calculations  using  supercomputers,  accessing  databases  via  remote 
terminals,  and  exchanging  scientific  results  through  communication  networks.  (2) 

Various  political,  legal,  and  economic  barriers  limit  the  full  realization  of  these  technologies  and 
the  benefits  that  accrue  from  their  use.  To  a  certain  extent  these  barriers  result  from  a  world  in  which 
institutions  have  been  unable  to  keep  pace  with  the  rapid  advances  in  technological  development.  They  also 
demonstrate  the  difficulties  of  making  the  transition  to  the  "information  age"  in  which  information  tech¬ 
nology  plays  a  critical  role.  In  some  instances,  they  represent  attempts  by  governments  to  control  the 
pace  of  technological  change.  In  others,  they  reflect  a  desire  to  protect  domestic  industries  from  foreign 
competitors  or  to  protect  national  security  against  adversaries.  Different  legal  systems,  along  with 
diverse  political  and  economic  environments,  also  contribute  to  conflicting  policies  for  information  transfer 
between  countries. 

The  growing  recognition  of  the  value  of  access  to  information  has  prompted  an  examination  on  both 
national  and  international  levels  of  existing  barriers  and  potential  conflicts  of  laws  and  regulations. 
Efforts  focus  on  harmonization  of  differing  legal  approaches  to  establish  mutually  agreeable  guidelines  for 
reducing  barriers.  Key  to  the  success  of  these  efforts  is  the  development  of  a  consensus  on  guidelines  that 
provide  for  adherence  to  common  principles,  while  retaining  flexibility  in  their  application. 

PRIVACY  PROTECTION 

One  of  the  earliest  manifestations  of  the  growth  of  computerized  information  sys1?*ms  was  a  concern 
for  the  protection  of  personal  privacy.  As  electronic  databases  proliferated  and  remote  access  became 
commonplace,  the  need  for  effective  safeguards  for  confidentiality  of  personal  data  emerged  as  a  priority 
for  many  nations.  In  response  to  the  potential  threats  posed  by  computer/communications  systems,  a  number 
of  nations  enacted  privacy  or  data  protection  laws.  Most  of  the  European  laws  covered  only  automated  files, 
while  in  the  United  States  privacy  protection  is  linked  to  the  individual  rather  than  the  medium  in  which 
personal  information  is  stored.  Generally  the  new  data  protection  laws  required  registration  of  public 
and  private  databases  containing  personal  information,  as  well  as  rights  of  access,  notice,  and  correction. 

In  some  instances,  these  laws  extend  beyond  individuals  to  cover  "legal  persons"  (corporations  or  institu¬ 
tions)  as  well.  (3) 

In  the  late  1970s  and  early  1980s  concern  arose  over  the  potential  abuse  of  these  laws  to  protect 
domestic  industries,  rather  than  strictly  to  provide  individual  privacy  protection.  Also,  multinational 
corporations  became  alarmed  over  the  necessity  to  adhere  to  differing  privacy  laws  in  different  countries. 
There  was  a  call  for  developing  some  consensus  on  basic  privacy  principles  that  would  establish  common 
ground  rules,  at  least  among  the  industrialized  nations.  The  Organization  for  Economic  Co-operation  and 
Development  (OECD)  provided  the  forum  for  establishing  agreement  on  these  fundamental  privacy  principles. 

In  1980,  the  OECD  Guidelines  on  the  Protection  of  Privacy  and  the  Transborder  Flows  of  Personal  Data  were 
completed.  (4)  At  the  same  time,  the  Council  of  Europe  (COE)  developed  Its  own  Convention  for  the 
Protection  of  Individuals  with  regard  to  Automatic  Processing  of  Personal  Data  (5)  for  its  member  states. 

Both  the  OECD  Guidelines  and  the  COE  Convention  limit  the  collection  of  personal  data,  restrict  disclosure 
to  authorized  purposes,  and  allow  for  individuals  to  inspect  and  challenge  their  records.  The  COE 
Convention,  however,  must  be  implemented  through  privacy  legislation  and  may  be  applied  to  legal  persons. 
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One  of  the  key  elements  of  the  OECD  Guidelines  is  the  flexibility  with  which  countries  can  demonstrate 
adherence  to  its  principles.  For  example,  in  the  United  States  there  is  no  omnibus  privacy  law,  but  rather 
a  tapestry  of  federal  and  state  laws,  which  in  conjunction  with  certain  constitutional  rights  establish 
privacy  protections.  (6)  Complementing  this  is  the  significant  number  of  companies  and  other  institutions 
that  maintain  internal  privacy  regulations.  Establishment  of  these  guidelines  has  eased  concerns  over  both 
privacy  protection  and  problems  of  operating  in  several  different  countries  with  different  legal  and 
regulatory  regimes.  Generally  multinational  corporations  have  indicated  that  privacy  laws  do  not  pose  a 
major  constraint  in  entering  new  markets  and  offering  services.  Although  some  concerns  remain  about  the 
potential  for  privacy  laws  to  limit  operations,  particularly  in  financial  services,  the  development  of 
consensus  guidelines  has  proved  beneficial  for  facilitating  international  information  transfer. 

ECONOMIC  AND  TRADE  BARRIERS 

Economic  and  trade  barriers  now  pose  greater  threats  to  the  transfer  of  information  than  privacy  laws. 

As  businesses  and  nations  compete  for  greater  portions  of  the  global  marketplace,  the  value  of  access  to 
information  becomes  more  critical.  As  indicated  earlier,  the  successful  operation  of  multinational  enter¬ 
prises  is  highly  dependent  on  effective  intracorporate  networks.  Further,  information  technology  makes 
possible  a  growing  array  of  new  products  and  services  that  can  expand  current  markets.  No  accurate  figures 
exist  for  the  value  of  world  trade  in  services,  but  estimates  exceed  $360  billion  and  continue  to  grow  at 
a  substantial  pace.  (7)  An  important  component  of  this  is  trade  in  telecommunications  and  computer  services, 
not  only  because  of  its  own  growth  potential,  but  also  because  it  is  critical  to  the  operation  of  other 
service  sectors. 

Economic  and  trade  barriers  to  these  activities  can  range  from  high  tariffs  to  incompatible  standards. 
Telecommunications  regulations  often  create  another  economic  barrier  to  international  computer /communica¬ 
tions  services.  These  barriers  restrict  not  only  the  growth  of  the  businesses  providing  computer/ 
communications  services,  but  also  the  activities  of  the  users  of  these  services.  (8)  A  number  of  competing 
and  conflicting  interests  arise  in  this  area.  How  can  intellectual  property  rights  be  secured,  while 
simultaneously  promoting  the  development  of  new  technologies?  How  can  effective  standards  for  products 
and  interconnection  be  established  without  inhibiting  innovation ?  How  can  access  to  telecommunications 
facilities  and  services  be  liberalized  while  maintaining  basic  transmission  capabilities?  How  can  new 
trading  rules  be  developed  for  services  that  will  promote  rather  than  discourage  market  expansion  and 
openness? 

Several  examples  may  demonstrate  how  economic  and  trade  barriers  create  obstacles  to  the  transfer  of 
information  internationally.  In  some  countries,  there  are  restrictions  on  the  use  of  foreign  data  processing 
facilities  that  require  the  use  of  redundant  facilities  domestically.  There  also  may  be  requirements  to 
purchase  domestic  technology,  either  by  limiting  imports  or  by  restricting  the  type  of  equipment  that 
can  be  attached  to  the  public  telecommunications  network.  Restrictive  interconnection  standards  provide 
another  barrier  to  the  use  of  equipment  that  does  not  have  approval  of  a  country's  telecommunications 
authority.  Efforts  to  limit  or  deny  access  to  flat-rate  private  leased  lines  create  concerns  as  well  for 
suppliers  of  international  computer  and  information  services.  (9)  Tariff  structures  in  countries  with  tele¬ 
communications  monopolies  often  provide  a  further  disincentive  to  use  of  international  computer/connnunica- 
tions  networks.  Individually  and  collectively  these  economic  and  trade  barriers  limit  expansion  of  infor¬ 
mation  services  and  the  technologies  critical  to  delivering  them.  This  harms  not  only  the  providers  of 
computer /communications  products  and  services,  but  perhaps  more  importantly  their  user=. 

Recently,  there  is  evidence  of  a  significant  trend  toward  reducing  the  role  of  telecommunications 
monopolies  and  promoting  competition  in  certain  sectors.  Traditional  telecommunications  monopolies  are 
being  replaced  by  a  mixture  of  private  and  public  entities  providing  a  growing  range  of  facilities  and 
services.  This  reflects  both  technological  developments  and  changing  market  structures.  Today,  more 
technological  choices  exist  for  transmitting  information  than  ever  before.  Advances  in  satellites,  as  well 
as  the  spread  of  fiber  optic  cables,  vastly  increase  available  capacity.  Lower  circuit  costs  combined  with 
tremendous  reductions  in  computing  costs  make  it  possible  to  offer  more  services  on  a  worldwide  basis. 

The  globalization  of  markets  further  contributes  to  the  pressure  to  modernize  telecommunications  networks 
and  provide  innovative  services. 

The  United  States  took  the  lead  in  promoting  competition  in  telecommunications  services.  The 
development  of  competition  in  long-distance  services,  the  divestiture  of  AT&T,  the  establishment  of  an 
"open-sky"  policy  allowing  for  domestic  satellite  competition,  and  the  reduction  of  regulation  by  the 
Federal  Communications  Commission  (FCC)  all  contributed  to  an  environment  in  the  United  States  which 
promoted  greater  diversity  of  equipment  and  services,  along  with  lower  prices.  More  recently,  other 
nations  have  begun  to  revise  their  existing  regulatory  structure  to  allow  for  greater  competition.  In 
the  United  Kingdom,  a  second  carrier  (Mercury  Ltd.)  was  authorized  to  compete  with  British  Telecommunications, 
an  independent  regulatory  entity  was  established,  and  competitive  vrlue-added  services  licensed.  Japan 
also  privatized  certain  telecommunications  operations  and  now  supports  competition  in  various  value-added 
services.  (10)  The  French  Government  is  in  the  process  of  separating  the  regulatory  function  from  their 
telecommunications  network  supplier  and  allowing  competition  in  value-added  services.  There  is  evidence 
that  this  trend  toward  greater  competition  in  the  provision  of  telecommunications  services  will  continue 
in  other  countries.  This  should  result  in  wider  service  offerings,  lower  prices,  and  greater  Innovation. 

The  effectiveness  with  which  the  transition  is  accomplished  will  be  significant  for  the  user  of  telecommuni¬ 
cations  facilities  and  services  and  the  ability  to  transfer  information  internationally.  Currently,  the 
United  States  is  exerting  considerable  pressure  on  its  major  trading  partners  to  ensure  that  as  competition 
develops  in  these  markets,  there  is  equitable  access  for  U.S.  providers  of  computer/communicat ions  services. 
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national  security  concerns 

Advances  in  science  and  technology  are  critical  to  the  maintenance  of  national  defense.  As  a  result, 
there  are  legitimate  concerns  about  access  to  our  scientific  and  technical  information  that  might  compromise 
national  security.  However,  in  the  United  States  the  enormous  advances  in  science  and  technology  have 
resulted  in  part  from  an  open  exchange  of  information  among  scientists.  The  dilemma  of  how  to  strike  an 
appropriate  balance  between  openness  and  controls  on  scientific  and  technical  information  continues  to 
create  problems  for  policymakers.  (11)  In  recent  years,  the  problem  has  received  significant  attention  for 
a  number  of  reasons.  The  linkage  between  scientific  developments  and  modern  weapons  systems  is  even 
stronger  than  before.  The  scientific  enterprise  is  increasingly  global  in  scale,  supporting  greater  inter¬ 
national  collaboration.  A  number  of  laws,  regulations,  and  Government  policies  have  increased  controls  over 
the  dissemination  of  technical  information. 

This  has  led  to  considerable  public  debate,  in  which  the  scientific  community  generally  has  spoken  out 
in  favor  of  reducing  controls  on  the  exchange  of  scientific  information.  They  cite  a  number  of  incidents 
over  the  last  few  years  where  limits  were  placed  on  presenting  papers  at  scientific  meetings,  foreign 
scientists  were  denied  visas  or  not  allowed  to  join  collaborative  research  efforts,  and  prepublication  review 
of  scientific  findings  was  requested.  The  Department  of  Defense  (DOD)  sought  to  clarify  the  issue  by 
developing  guidelines  for  publication  of  DOD-sponsored  research.  They  stated  that  unclassified  "fundamental" 
research  performed  at  universities  and  Federal  laboratories  would  not  be  restricted.  Fundamental  research 
is  defined  on  the  basis  of  the  performer,  budget  category,  sponsoring  DOD  entity,  and  special  contractual 
provisions.  (12) 

Most  recently,  however,  the  information  industry  as  well  as  the  users  of  scientific  and  technical 
information,  became  concerned  about  the  creation  of  a  new  "sensitive  but  unclassified"  category  of  infor¬ 
mation.  In  1984,  President  Reagan  signed  National  Security  Decision  Directive  (NSDD)-145  to  improve  security 
of  Government  computer  systems.  As  part  of  this  effort,  a  new  National  Policy  on  Protection  of  Sensitive, 

But  Unclassified  Information  in  Federal  Government  Telecommunications  and  Automated  Information  Systems 
(NTISSP  No.  2)  was  issued  on  October  29,  1986.  This  policy  statement  had  the  potential  for  limiting  items 
to  be  entered  into  databases  controlled  both  by  the  Government  and  the  private  sector  and  raised  questions 
about  limiting  access  to  what  had  previously  been  publicly  available  information.  The  new  initiative 
received  considerable  criticism  in  congressional  hearings  (13)  and  subsequently  was  rescinded  by  the  National 
Security  Advisor. 

The  National  Academy  of  Sciences  (NAS)  also  explored  extensively  the  Issue  of  controls  over  disseminating 
scientific  and  technical  information  and  technology  transfer.  In  1982,  NAS  issued  the  report  Scientific 
Communications  and  National  Security  in  which  they  found  that  the  Soviet  Union  acquires  a  substantial  amount 
of  U.S.  technology  relevant  to  military  systems.  They  further  found,  however,  that  very  little  of  this 
transfer  came  from  universities  and  open  scientific  communication.  (14)  In  early  1987,  a  second  NAS  panel 
established  to  assess  the  effect  of  controls  on  commercial  activities  in  high  technology  issued  its  report. 
Balancing  the  National  Interest:  U.S.  National  Security  Export  Controls  and  Global  Economic  Competition. 

The  panel  found  that  it  is  as  important  to  maintain  the  Western  technology  lead  through  open  scientific 
communication  and  trade,  as  it  is  to  control  the  technology  flow  to  the  Soviet  bloc.  (15)  As  the  NAS  studies 
indicate,  this  is  an  extremely  difficult  question  which  requires  ongoing  assessment  to  ensure  that  neither 
national  security  nor  scientific  advancement  are  jeopardized  as  information  transfer  policies  are  developed 
and  implemented. 

EFFORTS  TO  REDUCE  BARRIERS 

There  is  a  growing  recognition  worldwide  of  the  importance  of  facilitating  information  flows  to  promote 
scientific  advancement  and  commercial  developments.  A  number  of  nations,  both  on  a  bilateral  and  multi¬ 
lateral  basis,  are  working  to  establish  agreements  to  reduce  barriers  to  information  flows.  In  1985,  the 
OECD  Council  of  Ministers  adopted  a  Declaration  on  Transborder  Data  Flows.  The  Declaration  promotes 
cooperation  on  access  to  information  and  the  avoidance  of  barriers  to  the  international  exchange  of  data. 

It  also  seeks  to  make  regulations  transparent  and  to  develop  common  approaches  for  dealing  with  transborder 
data  flow  issues.  (16)  The  Declaration  also  serves  as  the  basis  for  future  work  by  the  OECD  in  facilitating 
transborder  data  flows. 

A  major  effort  is  underway  in  the  Uruguay  Round  of  the  General  Agreement  on  Tariffs  and  Trade  (GATT) 
to  establish  rules  for  trade  in  services  similar  to  those  for  trade  in  goods,  as  well  as  to  improve  intel¬ 
lectual  property  rights  internationally.  The  results  of  these  negotiations  are  bound  to  have  a  significant 
impact  on  the  provision  of  computer/conanunications  services  internationally.  In  the  same  timeframe,  the 
World  Administrative  Telephone  and  Telegraph  Conference  (WATTC-88)  of  the  International  Telecommunications 
Union  (ITU)  will  establish  definitions  that  will  form  the  basis  of  regulations  for  international  tele¬ 
communications  services.  The  expansiveness  of  these  definitions  will  be  significant  for  determining  what 
computer/communications  services  fall  within  the  purview  of  international  telecommunications  regulations. 

As  indicated  earlier,  the  ability  to  provide  competitive  services  on  a  cost-effective  basis  is  important  for 
both  suppliers  and  users  of  information  technology.  Both  of  these  multinational  negotiations  will  establish 
an  international  regulatory  framework  that  may  either  promote  or  inhibit  international  information  transfer. 
For  these  reasons,  considerable  time  and  resources  are  being  expended  by  the  U.S.  Government  and  the  private 
sector  to  work  toward  a  successful  outcome  of  these  negotiations. 
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To  allow  to  compare  computer  aided  translation  (CAT)  and  machine  translation  CUT)  sys¬ 
tems,  essential  criteria  and  typical  exponents  of  the  various  concepts  are  presented. 
Among  the  most  important  criteria  of  differentiation  are  the  following: 


-  Translation  of  complete  texts  (including  titlBS,  abstracts)  vs  lexical  aid  for  trans¬ 
lation  , 

-  All-round  translation  vs  translation  of  specialized  fields  and  texts' 

-  Bilingual  translation  vs  multilingual  translation, 

-  User-oriented  vs  administrator-oriented  expansion  of  dictionaries  and  systems.  a-.J 

-  Portability.  ' 


Included  essential  criteria  for  evaluation  are: 

\  \ 

-  Duality  of  the  raw  (also  called  informative)  translation/ ■”  advantages  /  disadvantages 
as  opposed  to  human  translation, 

-  Embedding  in  a  user  environ  ent  (data  bases,  indexing  and  retrieval,  word  processing, 
electronic  publishing)  ■ 

-  User  friendliness  , 

-  Costs  of  development  and  installation  .i.  J 

-  Running  costs  and  effects  of  realization. 

( 

List  of  sumbols 


CAT 

EC 

EUROTRA 

LOGOS 

hlNITEL 

MT 

PC 

SUSY/STS 

SYSTRAN 


Ulang  OIS 
WordPerfect 


Computer  aided  Translation 
European  Community 

European  Translation  System,  research  project  of 
the  European  Community 

Commercially  available  Machine  Translation  System 
of  LOGOS  Corporation 

Uideotex  system  access  tool  to  the  French  TELETEL 
Machine  Translation 
Personal  Computer 

Saarbrucken  Translation  System  /  ServicB 
Commercially  available  Machine  Translation  SystBm 
of  World  Translation  Center,  U.S.A.,  and  Gachot 
S . A . ,  France 

Office  Information  System  of  Wang  Computers 
Commercially  available  editor 


Introduction 


Considering  the  technical  development  in  information  industry,  one  has  to  realize  that 
the  computer  will  be  integrated  in  nearly  every  professional  human  activity.  In  the 
Field  of  word  or  text  processing,  man-machine  interaction,  on  PC  level  or  on  the  base 
of  a  work  station  an  terminal  instead  of  a  typewriter,  will  be  the  standard  in  the  near 
future,  that  means  From  the  end  of  the  eighties. 

On  the  other  side,  the  problem  of  automatic  translation  of  natural  text  in  general 
even  not  to  speak  about  speech  understanding  and  speech  translation  -  will  not  be 
solved  in  the  sense  of  HQFT  Chigh  quality  fully  automatic  machine  translation),  due  to 
the  complexity  of  natural  languages.  So  there  arB  limits  (more  or  Ibss  in  the  scientif¬ 
ic  approach)  and  possibi 1 i ties  of  integrating  technical  tools  in  the  process  of  Chu- 
man)  translation  or  even  text  understanding  by  machines. 

The  aim  of  this  contribution  is  to  explore  the  possibilities  of  usage  of  the  computer 
in  the  field  of  machine  and  machine  aided  translation.  Even  if  it  might  be  of  some  at¬ 
traction  to  treat  also  principal  problems  of  translatability  of  texts  Cand  understand¬ 
ing),  we  will  concentrate  on  technical  translation,  that  means  the  translation  of  tech¬ 
nical  or  common  texts. 


Systematic  aspects 

Even  if  it  might  be  of  some  interest  how  machine  translation  is  realized,  the  linguis¬ 
tic  aspects  Cesp.  the  models  of  grammar  and  the  possible  or  used  strategies)  play  an 
subordinate  part.  So  one  can  assume  that  there  is  a  kind  of  "black  box”,  where  a  text 
or  words  in  one  natural  language  are  put  in  and,  with  or  without  human  interaction  one 
or  alternative  translations  come  out  of  the  system.  The  translation  might  be  gjod,  use¬ 
ful  or  bad  under  the  aspects  of  the  user . 


4-2 


Concentrating  on  the  usage  and  usefulness  of  machine  translation  or  computer  aidBd 
translation,  one  has  -  in  principle  -  to  distinguish  between  tuio  main  user  groups: 

£13  the  so-called  end  user,  e.g.  an  expert  trying  to  get  the  information  of  an  article 
written  in  a  natural  language  more  or  less  unknown  to  him;  a  person  writing  a  let¬ 
ter  to  a  friend  in  a  foreign  language  . . .  and 

£23  a  professional  agent,  esp .  a  human  translator  who  tries  to  use  the  machine  as  a 
tool  in  the  process  of  fulfilling  his  job. 

Under  theSB  aspects,  one  can  let  aside  projects  and  basic  re  search  an  the  principles 
of  machine  translation  /  language  understanding  and  orientate  on  practical  tools  TBsp. 
systems . 

As  a  starting  point,  it  is  important  to  distinguish  ^etween  two  main  strategies:  ma¬ 
chine  translation  £(1T3  and  computer  aided  translation  (CAT!. 

o  A  systBm  will  bB  called  a  machine  translation  system  £HT3  only  if  a  translation 
process  -  starting  from  the  machine  readable  source  text  -  is  fulfilled  without  any 
human  interaction  to  reach  a  target  text  quality  at  least  ’’good  enough”  for  infor¬ 
mation  purposes . 

a  A  system  will  be  called  a  computer  aided  translation  system  (CAT!  if  a  human  inter¬ 
action  is  needed  or  foreseen  to  reach  the  aim  of  a  "good”  translation  of  a  source 
text  (machine  readable  or  not  3 . 

It  is  quite  clear  that  a  fIT  system  can  bB  used,  in  addition,  as  a  component  of  a  CAT:  a 
text  can  be  adjusted  before  the  (IT  starts  to  get  better  machine  translations  £ ”pre-ed- 
iting’’3,  and/or  a  machine  translated  text  can  be  post-edited  by  human  translators  to 
get  higher  quality. 

There  are  a  lot  of  systems  on  MT  and  CAT  on  the  market  claiming  to  bB  the  right  practi¬ 
cal  tool,  and  the  choice  is  hard  to  be  made  without  having  precise  criteria  for  the  de¬ 
cision  . 

Because  the  amount  of  data  in  computerized  dictionaries  -  on  thB  long  linB  of  develop¬ 
ment  of  riT  and  CAT  systems  -  is  the  decisive  component,  the  update  of  the  (electronic! 
dictionary  plays  a  substantial  part  in  both  alternatives: 

o  One  concept  may  be  -  especially  if  the  system  still  has  a  large  capacity  of  dic¬ 
tionary  entries  -  that  the  user,  e.g.  the  translator  should  not  takB  part  at  such  a 

process  (see,  far  example  the  SYSTHAN-concept 3 ,  so  that  there  are  specialists  need¬ 
ed  on  the  system  administration  side  to  do  the  Job  of  improving  the  dictionary  data 
base ; 

o  the  other  way  is  to  let  it  to  the  professional  user  to  complete  thB  system’s  dic¬ 
tionary  or  to  add  a  special  vocabulary  (sbb,  as  an  BxamplB,  the  LOGOS-concept 3 . 

There  are  other  system  aspects  which  play  an  important  part  on  the  decision  process: 
the  availability  of  language  pairs,  thB  possibility  of  using  Dr  handling  special  text 
types  (e.g.  minutes,  letters,  ...3. 


Criteria  of  evaluation 
Quality 

Even  if  one  normally  can  Cor  has  to!  handle  the  system  as  a  "black  box",  there  are  dif¬ 
ferences  in  the  quality  of  the  "pure"  machine  (raw!  translation  results  (output!.  It  is 
not  very  easy  to  give  a  precise  measure,  but  there  are  some  important  criteria  to  be 
noted  (see,  for  details,  esp.  the  concept  of  Uan  SLYPE  (1SB23  and  the  descriptions  of 
the  two  SYSTRAN  evaluations,  Uan  SLYPE  (1S7S33. 

The  main  criteria  under  these  aspects  are: 

(13  reliability  and  fidelity,  that  means:  to  what  degree  (bad,  good  enough,  good!  the 
content  /  meaning  of  the  original  text  is  conserved. 

<23  understandabllltu  and  lntalleolbllltu .  that  means.-  to  what  degree  the  end  user  is 
able  to  read  and  understand  the  translated  text. 

This  looks  quite  simple,  but  the  problems  occur  in  the  detail:  So,  on  the  syntax  and 
stylistic  level,  a  system  normally  will  produce  translations  which  are  worse  than  human 
translations,  whereas  on  the  lexical  level,  esp.  in  the  identification  of  the  right 
technical  termfs!,  a  system’s  translation  may  even  be  more  precise  and  consistent  then 
a  human  translation. 


Application  environment 

A  fIT  or  CAT  has  to  be  seen  in  the  (technical!  environment  of  application  possibilities. 
So,  normally  a  decision  is  not  only  quality  oriented,  but  also  based  on  the  possibili¬ 
ties  of  integration  into  a  complex  text  or  word  processing  system.  Under  these  aspects, 
the  following  components  have  to  be  considered- 


Integration  In  (bibliographical  or  textual?  data  bases 


Fleanwhile,  data  bases  are  -  technical lu  spoken  -  world-wide  acceealble  via  packet 
switching  networks  and  even  satelita  communication .  So,  overcoming  the  language  barri¬ 
ers,  a.g.  between  English  and  Japanese,  but  especially  in  the  European  multilingual 
market,  becomes  an  important  desire.  Experimental  eFForts  to  integrate  FIT  systems  in 
such  an  information  process  are  made  in  Japan:  as  one  example  the  INSPEC  data  bass 
which  is  originally  English  can  be  accessed  with  Japanese  key  words.  The  key  words  and 
later  the  (English?  title  are  autoaiatically  translated  during  the  dialogue  into  Japa¬ 
nese  (Nagao  et  al .  19023.  In  a  similar  way,  a  FIT  system  For  Berman  to  English  is  used 
via  a  batch  process  and  with  post-editing?  to  translate  the  titles  of  Berman  data  bases 
(Zimmermann  at  al  13871. 

It  is  quite  clear  that  translation  of  titles  and  abstracts  could  also  be  done  by  Fiuman 
translators.  But  there  are  some  arguments  For  FIT  and  CAT:  the  text  to  be  translated  la 
machine  readable,  so  there  is  an  ideal  base  For  using  a  computer,  the  Fields  or  areas 
on  which  the  title  /  text  is  oriented  are  normally  "physically"  marked,  so  that  the 
classification  or  even  thesaurus  Functions  can  be  used  especially  For  the  lexical 
transfer  (disambiguation? ,  tFia  technical  vocabulary  needs  to  be  very  precise,  so  that 
the  computer  helps  on  basing  consistent. 

AytQmgjUe  IMlSlng 

Indexing  of  (full?  text  might  be  a  good  side-effect  oF  using  FIT  and  CAT.  For  lexical 
transfer,  one  needs  to  derive  word  forms  to  basic  forms;  compounds  and  complex  words 
have  to  be  identified  as  such,  word  class  information,  even  relations  between  terms  are 
used  for  disambiguation  purposes.  So  it  can  be  considered  to  provide  such  output  or  in¬ 
termediate  results  of  FIT  systems  For  the  purpose  of  document  archiving  and  information 
retrieval . 

Text  and  word  processing 

There  la  no  doubt  that  text  proceaaing  plays  an  important  part  in  every  translation  en¬ 
vironment.  Even  free-lance  (human?  translators  mare  and  more  will  use  a  word  processing 
system  (on  PC?,  and  there  is  a  small  step  to  integrate,  in  any  way,  (private?  glossa¬ 
ries  or  word  lists  aoceeaibla  via  so-called  "wlndowe"  on  the  screen  -  ineteed  of  using 
card-index  boxes.  It  is  quite  clear  that  otFier  functions,  e.g.  spelling,  grammar  and 
style  checkers,  will  increasingly  be  integrated  in  such  a  process. 

As  a  result,  the  "source  text"  is  machine  readable.  But  tFie  FIT  and  CAT  system  has  to  be 
adapted  to  the  (different?  word  processing  systems  (For  example  the  klang  BIS  is  com¬ 
bined  with  LOBBS  and  SYSTRAN,  WordPerfect  is  combined  with  SYSTRAN?.  IF  euch  tools  are 
available,  post-editing  of  FIT  results  can  be  supported  by  the  special  editor. 

One  problem  in  this  environment  is  the  combinotion  of  those  tools  with  local  facilities 
of  FIT  (see,  e.g.,  LOBBS?  or  the  connection  with  a  Translation  Service  Center  (see, 
e.g.,  the  concept  of  SYSTRAN  application  in  the  European  Commission  or  even  the  use  of 
the  FIINITEL-System  in  France  to  get  machine  translations  (by  SYSTRAN? . 

There  is  no  doubt  that  the  usage  of  FIT  and  CAT  will  make  progress  mainly  in  combination 
with  text  processing  and  online  access.  The  question  is,  for  the  moment,  if  the  ex¬ 
isting  tools  are  powerful  enough  (in  quality?  that  tFiey  will  be  accepted  by  the  user. 
Especially  the  results  of  the  FIT  experiment  done  by  Bachot  S.A.  on  F1INITEL  and  on  PC- 
Access  (with  SYSTRAN?  will  be  an  important  contribution  in  this  direction. 


Electronic  Publishing 

Flore  and  more,  texts  to  be  translated  (asp.  technical  text  like  repair  instructions, 
manuals?  are  fully  prepared  -  including  Figures,  drawings,  tables,  pictures  -  via  elec¬ 
tronic  publishing,  nowadays  also  known  under  the  variant  of  desk  top  publishing.  Compa¬ 
nies  giving  commissions  to  translators  -  inside  Dr  outside  the  Firm  -  don’t  want  to 
rearrange  (or  compose?  the  complete  product  of  the  translation  for  even  a  lot  of  target 
languages . 

So  a  great  effort  has  to  be  undertaken  (may  be,  on  both  sides:  the  producer  of  elec¬ 
tronic  publishing  systems  and  the  producer  of  FIT  and  /  or  CAT  software  tools?  to  inte¬ 
grate  translation  helps  without  deleting  or  violate  the  document  structure.  BF  course, 
there  are  problrms  in  line,  paragraph  or  page  adjustment  (due  to  the  different  length 
of  translated  text?,  and  also  the  rearrangement  of  phrases  /  words  due  to  tFie  different 
word  order  raises  problems  on  the  correct  integration  of  typographical  markers  (bold 
face,  underlining  etc.?  in  the  target  language.  If  these  "tecFmical”  components  of 
translation  processes  will  not  to  be  let  to  the  human  translator  /  posteditor,  a  higFier 
standardized  level  text  description  must  be  integrated  in  the  electronic  publishing  fa¬ 
cilities.  TMs  will  be  c  great  challenge  to  the  existing  and  coming  FIT  and  CAT  systems. 

User  friendliness 

Bn  behalf  of  the  professional  translator’s  work  station,  the  user  friendliness  plays  an 
important  part  on  every  "level"  of  FIT  and  CAT.  Nobody  should  want  that  in  the  use  of  FIT 
or  CAT  tire  human  actor  has  to  play  the  part  of  a  "slave",  e.g.,  correcting,  day  by  day, 
only  the  "trivial"  mistakes  of  tFie  system’s  output.  There  is,  at  the  moment,  such  a 
danger,  because  the  existing  systems  are  not  vary  flexible  and  adaptive. 


Future  developments  in  MT  and  CAT  must  therefore  concentrate  on  activities  which  give 
the  user  more  and  direct  feedback  possibilities.  The  "private  file  concept"  which  some¬ 
times  is  used  in  data  base  systems,  where  a  user  can  select  and  create  a  "personal 
part"  in  a  data  base,  could  be  an  example:  at  least  on  the  dictionary  level  the  user 
should  yet  functions  to  realize  -  on  the  basis  of  the  existiny  data  -  his  "own"  dictio¬ 
naries  (physically  or  logically). 

By  the  way,  both  sides  -  the  provider  of  the  data  base  system  and  tha  user,  can  maka 
profit  of  such  a  concept:  the  system’s  specialized  vocabulary  will  be  ameliorated  and 
the  user  will  have  a  great  (but  also  responsible)  influence  in  tha  choice  of  tha  trans¬ 
lations  . 

What  is  true  for  tha  lexical  part,  should  also  be  applicable  to  structural  componenta 
(a.g.  to  influence  the  length  of  sentences,  some  stylistic  components  etc).  Existing 
systems  should  get  more  flexibility  and  coming  systems  should  consider  such  components 
from  tha  beginning. 

Coat  and  benefit 

As  everywhere  in  business,  the  decision  of  using  a  tool  like  RT  or  CAT  is  made  on  a 
cost  and  benefit  calculation.  In  this  case,  it  is  not  only  a  pure  monetary  problem,  be¬ 
cause  "time  is  money”  and  getting  translation  without  delay  might  be  paid  on  a  higher 
price . 

But  at  the  end,  the  decision  is  done  on  an  economic  base,  having  in  mind  tha  aocial  or 
human  affects . 

Becausa  there  ia  no  aubatantial  data  available,  at  least  to  tha  author,  on  the  coet  of 
development  of  RT  and  CAT  systems,  one  has  to  concentrate  at  the  cost  and  benefit  on 
the  user’s  side. 

It  seems,  at  the  moment,  that  the  amount  of  translations  (measured  in  pages  /  day) 
which  are  produced  by  a  human  translator  via  interaction  and  /  or  post-editing  can  be 
substantially  increased.  On  assuming  that,  for  example,  the  maintenance  and  technical 
application  of  a  system  like  SYSTRAN  will  cost  S300 . 000/year ,  and  that  300.000  pages 
can  be  technically  translated  by  the  system,  the  cost  of  the  pure  translation  -  not  in¬ 
cluding  the  update  of  the  dictionaries  and  the  preparing  or  post-editing  of  the  RT-re- 
sults  -  nearly  can  be  neglected  (SI  /  page).  Tha  cost  of  the  complete  process  (transla¬ 
tion  with  man-machine  interacting)  differ  depending  on  the  quality  one  wants  to  get. 
For  a  "good-enough"  translation  -  which  means  informative  translation,  say,  for  exam¬ 
ple,  far  working  papers,  a  so-called  rapid  post-editing  can  be  done,  so  that  a  human 
translator  produces  about  BO  pages  /  day  (instead  of  4-6  pages  without  any  RT) . 

To  reach  high  quality  comparable  to  professional  human  translations,  one  has  to  consid¬ 
er  more  time  for  revising  or  post-editing.  But  it  seems,  that  the  break-even  point  is 
reached  in  the  sense  that  if  the  vocabulary  of  a  RT  system  is  adapted  to  the  user’s 
field,  the  cost  of  translation  are  less  than  the  cost  of  pure  human  translation,  even 
considering  that  the  use  of  word  or  text  processing  systems  in  the  translation  process 
spares  about  20  %  of  time. 


Basic  linguistic  an  strategic  problems 

The  morphological  component  (that  means  problems  of  inflection,  derivation,  and  decom¬ 
position)  in  RT  has  been  successfully  solved,  at  least  for  practical  purposes  in  the 
application  environment,  even  if  the  correct  translation  of  identified  derived  and  de¬ 
composed  words  not  always  is  reached  via  automated  rules.  This  is  not  the  case  for  so¬ 
lutions  on  the  syntactic  or  semantic  level.  Even  if  we  assume  that  a  problem,  e.g.  the 
disambiguation  of  syntactic  homographs  (like  RAINS  in  IT  RAINS  or  THE  RAINS)  can  be 
solved  via  a  strict  and  fully  formalized  parsing  system,  the  complexity  of  natural  lan¬ 
guage  structure  leads  to  an  explosion  of  computer  time  if  one  tries  to  integrate  or 
handle  every  possible  (partial)  structure  or  occurence.  So,  in  reality,  commercially 
available  systems  try  to  shorten  the  process  of  identification  (or  disambiguation)  viB 
special  deterministic  rules  or  probabilities.  As  a  result,  they  run  10000  or  1000  times 
faster  than  a  fully  linguistic  oriented  systBm,  but  their  results  may  not  reach  the 
same  quality. 

Today,  computer  time  plays  not  any  more  the  same  role  as  so’ms  years  ago,  but  in  machine 
translation  computer  time  up  to  now  is  not  fully  neglectable.  The  same  is  truB  for  the 
solution  of  problems  of  homonymy,  that  means  in  the  semantic  field.  On  the  one  side, 
there  are  limits  in  the  handling  of  text  structure  vs.  sentence  structure.  In  most 
cases,  the  knowledge  base  of  a  RT  system  is  the  sentence  environment,  that  means  that 
information  or  solutions  of  previous  sentences  ere  lost  and  that  nearly  no  data  is 
known  at  the  text  level.  This  leads  to  many  problems,  especially  in  the  field  of  pro¬ 
nominal  reference,  but  also  in  article  insertion  and  homonym  disambiguation.  The  way 
existing  systems  are  handling  the  general  problem  of  semantic  ambiguities  is  by  intro¬ 
ducing  semantic  codes  (which  -  on  a  general  level  -  also  play  some  role  in  disambigu¬ 
ation  of  syntactic  structures),  especially  "field”  or  discipline  markers,  which  are 
used  to  select  the  "right”  word  (or  even  word  sequence)  depending  on  field  parameters 
given  by  the  user.  They  also  try  to  solve  this  problem  by  dictionary  look  up  to  identi¬ 
fy  word  sequences  or  even  sayings  (which  normally  have  to  be  lexlcalized  because  there 
exists  no  algorithmic  solution,  see  the  Berman  ES  REBNET  BINDFADEN,  which  has  to  be 
translated  by  IT  RAINS  CATS  AND  DOGS. 


Since  Chomsky ,  the  systematic  structural -semantic  access  to  language  analysis,  “under¬ 
standing*  end  translation  has  made  soma  progress .  So,  on  the  research  level,  there  are 
several  modern  Formalized  grammar  types  and  parsing  systems  available.  Especially  in 
Japan  (sea,  a.g.,  the  MU-System)  and  Europe  (e.g.  the  efforts  made  by  the  European  Com¬ 
munity  end  their  member  states  with  the  European  Translation  System,  EUROTRA)  research 
in  ITT  ia  continuing.  But  it  seems  that  one  needs  more  than  computational  linguistic  de¬ 
velopment:  linguists,  computer  specialists,  information  scientists  and  users  have  to 
cooperate  in  larga-acale  projects  to  reach  the  aim  of  practical  usability. 


Examples 

To  give  some  impressions  on  the  state  of  the  art  of  so-called  productive  (not  to  say 

commercial)  systems,  and  alao  to  show  how  tha  mentioned  criteria  can  be  applied,  two 

systems,  the  FIT  systems  SYSTRAN  and  SUSY /STS  will  be  described. 

SYSTRAN 

SYSTRAN  (commercial  rights  at  Sachet  S.A.,  France)  -  in  its  newest  version  3.7  -  has 

the  following  characteristics: 

-  Translation  of  full  text.  Evan  if  the  atructurs  is  not  correct  or  words  are  mis¬ 
spelled  or  words  are  not  found  in  the  machine  dictionary,  a  translation  is  done. 

-  The  speed  of  translation  (depending  on  computer  capacity)  reaches  up  to  350.000  words 
per  hour.  So  it  is  the  fastest  ayatam  available  on  the  market. 

-  The  system  is  language-pair  oriented.  Translations  are  available  for  the  pairs  En¬ 
glish  ->  French,  English  ->  Italian,  French  ->  English,  Russian  ->  English  (USAF), 
English  ->  Japanese  (SYSTRAN  JAPAN),  English  ->  Arabic,  under  development  are,  be¬ 
sides  others,  English  ->  Barman,  French  ->  German,  Berman  ->  English  and  Berman  -> 
French.  The  quality  depends,  an  the  one  side,  on  tha  availability  of  (discipline  ori¬ 
ented)  dictionaries.  A  great  effort  has  bean  made  at  the  European  Community  to  devel¬ 
op  the  SYSTRAN  dictionaries.  For  tha  translation  from  English  to  French,  the  diction¬ 
ary  now  contains  more  than  150,000  entries.  The  same  quality  is  not  reached,  e.g., 
for  translation  from  Barman  to  French,  which  is  in  the  starting  phase,  even  on  the 
level  of  language  analysis. 

-  SYSTRAN  application  needs  technical  specialists  (and  administration).  So  only  compa¬ 
nies  which  are  able  to  raalize  a  special  staff  (e.g.  the  EC  or  USAF)  have  tha  possi¬ 
bility  to  use  a  SYSTRAN  version  on  their  own  computer  (if  the  computer  is  a  main¬ 
frame  IBM  or  IBM  compatible).  But  there  is  an  interesting  alternative:  to  use  the 
system  via  telecommunication  networks,  e.g.  packet  switching  or  -  as  a  very  'futuris¬ 
tic*  example  -  vie  videotex.  In  France,  a  videotex  application  (using  the  French  vid¬ 
eotex  version,  called  TELETEL,  via  a  telephone  combined  with  a  screen,  called  MINI- 
TEL)  is  already  available  (and  even  used  by  pupils). 

-  Dictionary  maintenance  -  up  to  now  -  for  SYSTRAN  dictionaries  normally  has  to  be  done 
by  system  experts.  The  main  problem  is  not  the  coding  Itself  (which  is  very  complex, 
but  could  end  will  be  handled  by  uset — friendly  interfaces),  but  the  consistency  of 
the  dictionary  data  base.  Dictionaries  contain  area  codes,  but  this  component  has  to 
be  developed  to  get  a  more  flexible  user  and  usage  orientation. 

-  SYSTRAN  is  nearly  not  portable,  that  means:  the  basic  cods  is  an  IBM-Assembler ,  even 
if  the  linguistic  rules  normally  are  written  in  a  special  macro  language.  The  system 
itself  needs  -  as  mentioned  above  -  a  mainframe  (IBM  or  SIEMENS  or  AMDAHL)  computer 
or  computers  of  similar  size.  A  software  revision  (may  be,  on  tha  base  of  UNIX)  is 
planned . 

-  Tha  quality  of  (raw)  translation  differs  depending  on  the  language  pairs  and  diction¬ 
ary  satisfaction:  If  one  concentrates  on  English  ->  French,  the  fallowing  percentages 
may  apply:  Morphological  identification:  about  100  syntactic  structures.-  about  SO 

semantic  disambiguation:  between  B0  and  SO  *,  depending  on  the  integration  of  so- 
called  limited -semantic  rules. 

-  SYSTRAN  can  be  combined  with  several  word  processing  environments.  One  is  the  Wang- 
0IS  (which  is  used  at  the  EC),  but  one  can  also  use  PC  (IBM  compatible)  with  an  edi¬ 
tor  like  WordPerfect.  There  ere  applications  where  one  uses  an  optical  character 
reader  to  create  a  text  file  with  WordPerfect,  sending  the  data  to  the  SYSTRAN  ser¬ 
vice  center  (e.g.  at  Sechot  S.A.,  Paris)  and  getting  back  the  translated  text  vie 
post  line  and  a  software  tool  which  allows  to  post-edit  the  text  having  both  versions 
-  the  original  and  the  translation  -  on  a  splitted  screen. 

-  The  usage  or  the  translation  oystem  itself  is  batch  oriented.  During  the  process  of 
translation,  there  is  no  possibility  of  intervention  (despite  the  fact  that  the  sys- 
tem  administrator  has  the  possibility  to  Introduce  translations  for  uc..’-nown  words). 
On  the  other  side,  the  user  handles  the  system  as  a  black  box,  he  doesn’t  need  any 
knowledge  of  the  system. 

-  There  is  no  information  available  about  the  development  coat  of  the  system.  One  oen 
estimate  that  they  are  between  S  SO, 000,000  and  S  50,000,000.  The  EC  application 
■lone  cost  about  B  4,000,000  and  S  5,000,000.  The  eost  running  a  version  (without 
royalties)  lie  -  in  my  opinion  -  at  S  300,000  /  year.  So  one  can  estimate  that  if  one 
is  able  to  translate  about  300,000  pages  a  year  (e.g.  at  the  EC),  that  the  system 
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cost  ira  lass  than  9  1/  pass  (word  processing,  post-editing  and  computer  tints  rot 
included).  The  translation  fas  on  ths  French  fUNITEL  (videotex)  systems  is  about  9 
0.10  par  ’windm.-’  (up  to  10  lines),  the  automatic  (raw)  translation  normally  is 
available  in  this  application  within  45  to  E0  seconds. 

-  This  leads  to  ths  conclusion  that  SYSTRAN  is  end-user  oriented,  that  moans  that  its 
market  is  ths  ’information  society’  whera  100  \  quality  is  not  needed,  but  flood- 
enough  translation  to  understand  a  text  written  in  a  language  not  or  nearly  not  known 
by  the  user .  But  there  is  no  doubt  that  SYSTRAN  has  also  a  good  chance  to  be  used  in 
the  professional  translation  environment  as  a  high  spaed  tool  and  a  possible  alterna¬ 
tive  in  computer  aided  translation. 

SUSY /STS 

SUSY/STS,  developed  in  tha  framework  of  a  large  research  project  in  computational  lin¬ 
guistics  at  the  university  of  Saarbrucken  (FRS),  now  is  used  as  production  system  in 

the  field  of  translation  of  date  basae.  This  service  is  done  by  the  Institute  of  ap¬ 
plied  Information  Research  (IAI)  at  Saarbrucken.  The  main  characteristics  are: 

-  Translation  of  Full  texts  (sentence  level)  is  possible.  In  the  application  environ¬ 
ment,  one  concentrates  on  title  (and  abstract)  translation. 

-  As  SYSTRAN,  SUSY/STS  is  an  “all-round”  and  robust  translation  system:  A  text  is 
translated  even  if  there  are  unknown  words  included.  In  such  a  case,  the  original 
word  is  integrated  in  ths  text  of  the  target  language. 

-  To  soma  extent,  SUSY/STS  is  a  ’multilingual”  MT  system.  Only  tha  transfer  component 
(between  two  languages)  is  bilingual,  whereas  tha  analysis  and  synthesis  ie  done  in¬ 
dependent  from  tha  target  reap,  source  language.  SUSY/STS  applications  ere  poaeibla 
for  the  language  directions  Barman  — >  English,  English  ->  Berman  and  Russian  — >  Ber 
man,  soma  effort  has  also  bean  made  for  tha  development  of  French  ->  Berman  and  Espe¬ 
ranto  ->  Berman.  Tha  main  application  is  made  in  Barman  ->  English,  where  a  basic 
Berman  analysis  dictionary  of  about  150,000  entries,  a  Berman  compound  dictionary  of 
about  140,000  entries,  a  Berman  ->  English  dictionary  of  about  80,000  entries  is 
available . 

-  As  SYSTRAN,  SUSY/STS  needs  an  administrative  staff  to  handle  the  application.  There¬ 
fore,  ana  concentrates  on  a  translation  service  concept.  Tranelations  are  proceeeed 
only  at  the  computer  center  of  the  IAI,  Saarbrucken.  Tha  clients  have  to  send  their 
data  via  magnetic  tape  to  the  IAI,  they  get  back  a  magnetic  tape  including  the  trans¬ 
lations.  Tha  main  applications  are,  as  mentioned,  the  translation  of  Berman  titles 
coming  from  different  data  bases  to  English.  Besides,  the  bilingual  terminology  in 
the  different  disciplines  is  updated,  the  dictionaries  are  augmented  with  the  special 
terminology  during  the  processing  of  the  client’s  data.  Clients,  besides  others,  are: 
The  Information  Center  for  Building  Construction,  where  a  bilingual  data  base  (ICON- 
DA)  is  produced;  the  Barman  Patent  Office,  where  -  at  tha  moment  -  the  Berman  Catch¬ 
word  Index  la  translated  to  English;  the  Bermen  Institute  of  Standards,  where  the  ti¬ 
tles  of  Barman  industrial  standards  are  translated  to  English  to  be  integrated  in  the 
corresponding  data  base;  the  Information  Center  for  Social  Science,  where  ths  titles 
of  a  bibliographical  data  base  are  translated.  Up  to  now,  more  than  100,000  titles 
have  been  translated  by  this  service. 

-  The  System  is  available  under  UNIX  (and  in  this  sense,  porteble),  the  programming 
language  is  FORTRAN,  with  some  basic  parts  written  in  ”C”.  A  version  is  also  avail¬ 
able  on  SIEMENS  mainframe  computer. 

-  The  quality  of  the  (raw)  machine  translation  (Berman  ->  English)  is  satisfying:  about 
93  \  identification  of  words  in  the  source  language  on  morphological  level;  about  90 
X  correct  identifications  on  the  syntactic  level.  There  are  no  statistics  available 
about  the  correctness  of  semantic  disambiguation,  but  development  of  tools  for  inte¬ 
grating  discipline  codes  is  under  progress. 

-  The  service  normally  Integrates  the  post-editing  to  reach  a  high  quality  of  transla¬ 
tion.  As  just  mentioned,  the  client  sends  its  data  via  magnetic  tape  to  the  service 
center.  As  a  first  step,  a  spelling  check  is  done  to  identify  and  correct  misspelled 
words.  Then,  the  unknown  words  (mainly  in  the  transfer  dictionary  of  the  MT  system) 
are  identified,  the  dictionary  is  then  updated  (by  the  human  translator).  In  the  next 
step,  the  automatic  raw  translation  is  done  and  the  text  is  post-edited  (on  screen 
using  a  PC  or  terminal)  by  the  human  translator.  After  a  controlling  phase,  the  mag¬ 
netic  tape  is  sent  back  to  the  client.  It  is  possible  to  integrate  an  indexing  sys¬ 
tem,  that  means  to  use  the  analysis  phasa  of  SUSY/STS  to  produce  lemmatized  words 
(instead  of  word  forms)  and  decompose  or  derive  complex  words  to  get  relations  be¬ 
tween  compounds  and  the  single  words  from  which  they  are  derived. 

-  As  SYSTRAN,  SUSY/STS  merely  is  batch-oriented.  But  a  special  trained  user  would  be 
able  to  integrate  new  words  into  the  different  dictionaries:  the  dictionary  mainte¬ 
nance  is  dialogue-oriented.  Post  edition  can  be  done  ssperately  on  PC  or  via  terminal 
on  the  host  system  (i.e.  the  mainframe  computer,  which,  at  the  service  center,  is  a 
Nixdorf  TARB0N  35). 

-  The  development  cost  of  the  system  were  about  9  5,000,000  (basic  research  at  the  uni¬ 
versity  not  included).  The  adaptation  tv  the  STS  service  environment  cost  about  9 
500,000.  The  basic  cost  keeping  tha  system  available  and  running  are  about  9  150,000 
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/  year ,  the  service  personal  (not  the  translators)  included.  The  service  is  cost-cov¬ 
ering,  if  about  tOO, 000  titles  or  sentences  can  be  translated  (fas  /  title:  8  DPI  - 
about  %  0.80. 

-  As  a  conclusion,  one  can  say  that  SUSV/STS  is  a  specialized  CAT  system.  It’s  speed 
(5,000  running  words  /  hour,  CPU  time,  can  be  translated)  is  not  comparable  with  SYS¬ 
TRAN,  and  there  will  not  be  an  augmentation  of  the  language  pairs.  But  it  can  be 
shown  that,  under  consideration  of  the  possibilities  and  by  adaptation  to  special 
needs,  ITT  systems  will  play  an  interesting  role  in  the  future. 

Under  the  limits  of  this  article,  it  is  not  passible  to  give  similar  descriptions  of 
other  existing  (and  available)  systems.  It  also  needs  a  good  familiarity  in  the  use  and 
a  good  knowledge  of  the  conceptual  level .  The  aim  was  to  give  some  concrete  examples  to 
give  a  first  impression  of  the  complexity  of  this  theme.  So  it  seems  to  be  to  simple  to 
handle  FIT  from  the  pure  ’’linguistic*  view  (in  the  sense,  that  the  quality  of  a  FIT  is  or 
is  not  comparable  with  the  quality  of  human  translation)  or  -  on  the  other  side  -  to 
handle  FIT  as  a  pure  technical  software  tool:  Progress  can  only  be  made  if  one  considers 
the  limits  and  the  possibilities  of  FIT  in  concrete  application  environments. 


5^  Outlook 

If  one  looks  in  a  distant  future,  it  seems  quite  clear  that  professional  translation 
will  normally  be  undertaken  by  using  FIT  systems.  It  depends  more  or  less  on  the  avail¬ 
ability  of  the  "right"  language  pairs,  the  completeness  of  machine  vocabularies,  the 
availability  of  the  "right"  technical  environment  and,  last  not  least,  on  the  cost  of 
the  utilization  of  the  system. 

In-between  (and  besides),  CAT  systems  on  the  level  of  vocabulary  and  terminology  aid  - 
integrated  in  or  added  to  text  processing  systems,  will  play  a  substantial  part,  esp. 
as  an  aid  during  tire  creation  of  a  text  in  a  foreign  language  by  the  author  or  to  un¬ 
derstand  a  written  machine  readable  text  by  the  receiver  arriving  via  telecommunication 
(Teletex).  In  the  office  environment,  CAT,  mainly  with  bilingual  dictionariee  on  CD-ROM 
or  hard  disk  will  be  used  as  a  variant  of  style  and  grammar  help. 

On  *-he  contrary  of  some  assumptions  (see,  e.g.,  Hutchins  19B6,  pp  331  -  334),  we  don’t 
see  any  need  for  a  special  translator 1  a  work  station,  but  there  will  special  functions 
as  translation  help  be  integrated  in  powerful  writer 'a  workstations.  so  that  everybody 
will  make  profit  of  the  development  in  computerized  dictionaries  and  thesaurus. 

Uhat  will  happen  with  the  human  professional  translator  in  the  environment  of  technical 
translations?  One  can  see  FIT  system  as  a  special  expert  system  with  the  problem,  that 
it  will  nearly  never  reach  -  generally  spoken  -  the  full  quality  of  good  and  special¬ 
ized  human  translators.  But  to  develop,  to  augment  and  even  to  maintain  such  transla¬ 
tion  expert  systems,  one  needs  "language  knowledge  engineers"  which  are  able  to  handle 
and  to  feed  these  systems.  On  behalf  of  the  complexity  of  natural  language  and  knowl¬ 
edge,  there  will  be  cooperation  between  the  system  and  the  translator.  And  there  is  a 
good  chance,  that  if  translation  of  texts  will  become  cheaper  and  faster,  more  demand 
will  arise,  so  that,  at  the  end,  human  translators  will  still  play  a  substantial  part  - 
under  changed  conditions. 


literature: 

Hutchins,  U.J.  (1SBE):  Machine  translation:  past,  present,  future.  Chichester. 

Nagao,  M.  at  al  (1988):  An  English-Japanasa  machine  translation  system  of  the  titles  of 

scientific  and  engineering  papers.  In:  C0LINB  1988,  Amsterdam. 

Uan  Slype,  G.  (1979):  Dsuxiime  Evaluation  du  systlme  da  traduction  automatiqus  SYSTRAN 
anglais-f rangais  de  la  Commission  dss  CommunautSs  EuropBennes .  Bruxelles. 

Uan  Slype,  B.  (1SB8):  Conception  d'une  mithodologie  gBnBrale  devaluation  de  la  traduc¬ 
tion  automatique.  Multilingua  1,  881-E37. 

Zimmermann,  H.,  Kroupa,  E.,  Luckhardt,  □.  (1987):  STS.-  Das  Sasrbrucksr  Ubersetzungs- 
systam.  Saarbrucken  19B7 . 


Aspects  linguistiques  et  techniques  de  la  traduction  automatique 


Harald  H.  Zimmermann 
University  Je  Saarlande 
5.5  Informationswissenschaft 
D-6600  Saarbriicken  1 1 
RFA 


Resume 

Pour  permettre  de  comparer  les  systemes  de  traduction  assistee  par  ordinateur  (CAT)  et  de  traduction  automatique  (MT).  les 
criteres  essentiels  et  les  caracteristiques  des  differents  concepts  sont  presentes.  Parmi  les  criteres  majeurs  de  differenciation 
figurent: 

—  T raduction  de  textes  complets  (y  compris  les  litres,  les  resumes)  par  opposition  a  une  aide  lexicale  a  la  traduction; 

—  Traduction  d’ordre  general  par  opposition  a  une  traduction  de  textes  dans  des  domaines  specialises; 

—  Traduction  bilingue  par  opposition  a  traduction  multilingue; 

—  Enrichissement  des  dictionnaires  et  des  systemes  oriente  vers  I'utilisateur; 

—  Portability. 


Parmi  les  criteres  essentiels  devaluation  sont: 

—  Qualite  de  la  traduction  brute  (aussi  appelee  informative);  Avantages/inconvenients  par  opposition  a  la  traduction 
humaine; 

—  Integration  dans  un  environnement  d'utilisateurs  (bases  de  donnees,  indexation  et  recherche,  traitement  de  textes,  edition 
electronique); 

—  Convivialite; 

—  Couts  du  developpement  et  de  ('installation; 

—  Couts  de  fonctionnement  et  effets  de  realisation. 


Enumeration  des  symboles 


CAT 

CE 

EUROTRA 

LOGOS 

MIN1TEL 

MT 

PC 

SUSY/STS 

SYSTRAN 


Wang  OIS 
WordPerfect 


Traduction  assistee  par  ordinateur 
Communaute  europeenne 

Systeme  europeen  de  traduction,  projet  de  recherche  de  la  Communaute  europeenne 
Systeme  commercialement  disponible  de  traduction  automatique  de  LOGOS  Corporation; 

Outil  d’acces  du  systeme  videotex  au  TELETEL  fran<;ais 
Traduction  automatique 
Ordinateur  personnel 

Systeme/service  de  traduction  de  Saarbriicken 

Systeme  commercialement  disponible  de  traduction  automatique  de  World  Translation  Center.  Etats-Unis  et 

de  Gachot  S.A.,  France 

Systemes  de  bureau  Wang 

Editeur  commercialement  disponible 


INTRODUCTION 


Vu  le  developpement  technique  de  l'industrie  de  l'information,  on  doit  realiser  que  I'ordinateur  sera  integre  dans  presque 
chaque  activity  humaine  professionnelle.  Dans  le  domaine  du  traitement  de  textes.  ['interaction  homme-machine,  au  niveau  du 
PC  ou  sur  la  base  d'un  poste  de  travail  sur  terminal  au  lieu  d'une  machine  a  ecrire,  sera  la  norme  dans  un  proche  avenir,  a  savoir 
d'ici  a  la  fin  des  annees  80. 

D’autre  part,  le  probleme  de  la  traduction  automatique  de  texte  naturel  en  general  —  sans  meme  parler  de  la  comprehension  de 
la  parole  et  de  la  traduction  de  la  parole  —  ne  sera  pas  resolu  dans  le  sens  de  la  HQFT.  (traduction  entie.  •  ment  automatique  de 
haute  qualite),  en  raison  de  la  complexity  des  langages  naturels.  II  y  a  done  des  limites  (plus  ou  moins  dans  l'approche 
scientifique)  et  des  possibilites  d’integration  des  outils  techniques  dans  le  cours  de  la  traduction  (humaine)  ou  meme  de  la 
comprehension  du  texte  par  la  machine. 

Le  but  de  cette  contribution  est  d’expiorer  les  possibilites  de  I'utilisation  de  I'ordinateur  dans  le  domaine  de  '  >  traduction 
automatique  et  de  la  traduction  assistee  par  ordinateur,  Meme  s’il  serait  interessant  de  traiter  egalement  les  principaux 
problemes  de  traduisibilite  des  textes  (et  de  la  comprehension),  nous  nous  concentrerons  sur  la  traduction  technique,  e’est-a- 
diie  la  traduction  de  textes  techniques  ou  communs. 

Aspects  systymatiques 

Meme  s’il  pourrait  etre  interessant  de  savoir  comment  la  traduction  automatique  est  realisee,  les  aspects  linguistiques  (en 
particulier  les  modeies  de  la  grammaire  et  des  strategies  possibles  ou  employees)  ont  un  role  secondaire.  Aussi  on  peut 


supposer  qu'il  y  a  unc  sortc  de  “boite  noire",  ou  un  texte  ou  des  mots  dans  un  langage  nature!  sont  inseres  et.  uvcc  ou  sans 
interaction  humaine,  une  ou  plusieurs  possibilities  de  traductions.  La  traduction  pourrait  ctrc  bonne,  utile  ou  mauvaise  scion  les 
avis  des  utilisateurs. 


En  se  concentrant  sur  I'utilisation  et  I'utilite  de  la  traduction  automatique  ou  de  la  traduction  assistee  par  ordinateur,  on  doit 
en  principc  —  faire  la  distinction  entre  deux  groupes  d'utilisateurs  principaux: 

( 1 )  I'utilisateur  dit  final,  e'est-a-dire  un  specialiste  qui  essaie  de  savoir  de  quoi  parlc  un  article  ecrit  en  langage  naturel  plus  ou 
moins  ineonnu  de  lui;  une  personne  ecrivant  une  lettre  a  un  ami  en  langue  etrangere...  et 

(2)  le  professionnel,  en  particulier  un  traducteur  humain  qui  essaye  d'utiliser  la  machine  comme  un  outil  dans 
I’accomplissement  de  son  travail. 

Vu  sous  cet  angle,  on  peut  laisser  de  cote  les  projets  et  la  recherche  fondamentale  sur  les  principes  de  la  traduction  automatique 
/  comprehension  des  langues  et  s'interesser  plus  particulierement  aux  outils.  systemes  pratiques. 

Comme  point  de  depart,  il  est  important  de  faire  la  distinction  entre  deux  strategies  principales:  traduction  automatique  (MT) 
et  traduction  assistee  par  ordinateur  (CAT). 

—  Un  systeme  s'appellera  un  systeme  de  traduction  automatique  (MT)  seulement  si  le  processus  de  traduction  —  en  partant 
du  texte  source  lisible  sur  ordinateur  —  est  accompli  sans  aucune  interaction  humaine  pour  atteindre  une  qualite  de  texte  cible 
au  moins  “assez  bonne"  pour  I'information. 

—  Un  systeme  s'appellera  un  systeme  de  traduction  assistee  par  ordinateur  (CAT)  si  une  interaction  humaine  est  necessaire 
ou  prevue  pour  atteindre  le  but  d’une  "bonne"  traduction  d'un  texte  source  (lisible  sur  ordinateur  ou  pas). 

II  est  bien  evident  qu'un  systeme  de  traduction  automatique  peut  venir  en  complement  d'un  systeme  de  traduction  assistee  par 
ordinateur:  un  texte  peut  etre  ameliore  avant  de  debuter  la  traduction  automatique  de  fa?on  a  obtenir  une  meilleure  traduction 
automatique  ("pre-edition"),  et/ou  un  texte  traduit  par  ordinateur  peut  etre  post-edite  par  un  traducteur  humain  pour  obtenir 
une  meilleure  qualite. 

II  existe  sur  le  marche  de  nombreux  systemes  de  traduction  automatique  et  de  traduction  assistee  par  ordinateur  qui  pretendent 
etre  les  veritables  outils  pratiques,  et  le  choix  est  difficile  a  faire  sans  avoir  de  criteres  precis  pour  en  decider. 

Puisque  la  quantite  de  donnees  dans  les  dictionnaires  automatises  —  sur  la  longue  periode  de  developpement  des  systemes  MT 
et  CAT  —  est  le  composant  decisif,  la  mise  a  jour  du  dictionnaire  (electronique)  joue  un  role  substantiel  dans  les  deux  options 
suivantes: 

—  Un  concept  peut  etre  —  en  particulier  si  le  systeme  a  encore  une  grande  capacite  d'entrees  dans  le  dictionnaire  —  que 
I'utilisateur,  par  exempk  le  traducteur  ne  devrait  pas  participer  a  un  tel  processus  (voir,  par  exemple  le  concept  Systran),  de 
sortc  qu'il  y  ait  des  specialistes  requis  du  cote  de  ('administration  de  systeme  pour  ameliorer  la  base  de  donnees  du  dictionnaire; 

—  L'autre  maniere  est  de  laisser  le  soin  a  I'utilisateur  professionnel  de  completer  le  dictionnaire  du  systeme  ou  y  ajouter  un 
vocabulaire  special  (voir,  par  exemple,  le  concept  Logo). 

II  y  a  d'autres  aspects  du  systeme  qui  jouent  un  role  important  dans  le  processus  de  decision:  la  disponibilitc  des  couples  de 
langues,  la  possibility  d'utiliser  ou  de  traiter  des  types  de  texte  speciaux  (par  exemple.  minutes,  lettres,...). 

Criteres  (('evaluation 

Qualite 

Meme  si  on  peut  (ou  doit)  normalcment  traiter  le  systeme  comme  une  "boite  noire”,  il  existe  des  differences  dans  la  qualite  des 
resultats  “bruts"  de  traduction  automatique  (sortie).  Il  n'est  pas  tres  facile  de  donner  une  mesure  precise,  mais  il  y  a  certains 
criteres  importants  a  noter  (voir,  pour  les  details,  en  particulier  le  concept  de  Van  SLYPE  (1982)et  les  descriptions  des  deux 
evaluations  SYSTRAN.  Van  SLYPE  ( 1 979). 

Les  criteres  principaux  a  considerer  sont: 

(1)  fiabilite  et  fidclite.  ce  qui  signifte:  a  quel  degre  (mauvais.  assez  bon,  bon)  la  teneur/signification  du  texte  original  est 
conservee. 

(2)  comprehensibilite  et  intellegibilite,  ce  qui  signifte:  a  quel  degre  I'utilisateur  final  peut  lire  et  comprendre  le  texte  traduit. 

Ccci  parait  assez  simple,  mais  les  problemes  se  produisent  dans  le  detail;  Ainsi,  au  niveau  syntaxique  et  stylistique,  un  systeme 
produira  normalement  des  traductions  qui  sont  pires  que  les  traductions  humaines,  tandis  que  au  niveau  lexical, 
particulierement  dans  I'identification  du(des)  bon(s)  terme(s)  technique(s),  la  traduction  machine  peut  meme  etre  plus  precise 
et  consistante  qu'une  traduction  humaine. 
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Environnement  des  possibilites  d'application. 

Un  systeme  de  traduction  automatique  ou  de  traduction  assistee  par  ordinateur  doit  etre  considere  dans  1'environnement 
(technique)  des  possibilites  d’application.  Ainsi,  normalement  une  decision  est  non  seulement  orientee  vers  la  qualite.  mais 
egalement  basee  sur  les  possibilites  d’integration  dans  un  systeme  de  traitement  de  texte.  Vu  sous  cet  angle,  les  elements 
suivants  doivent  etre  consideres: 

Integration  dans  des  bases  de  donnees  (bibliographiques  ou  textuelles). 

Entretemps,  les  bases  de  donnees  sont  —  techniquement  parlant  —  mondialement  accessibles  par  I’intermediaire  de  reseaux  de 
commutation  et  meme  de  satellites  de  communication  Ainsi,  reduire  les  barrieres  linguistiques,  par  exemple  entre  l’anglais  et  le 
Japonais,  mais  particulierement  sur  le  mar.he  multilingue  europeen,  devient  un  vif  souhait.  Des  efforts  experimentaux  sont 
consacres  au  Japon  pour  integrer  les  systemes  de  traduction  automatique  dans  un  tel  processus  de  I'information:  par  exemple  la 
base  de  donnees  INSPEC  qui  est  originellement  anglaise  peut  etre  interrogee  avec  les  mots  cles  japonais.  Les  mots  cles  et  plus 
tard  les  titres  (anglais)  sont  automatiquement  traduits  au  cours  du  dialogue  en  japonais  (Nagao  et  autres  1982).  D  une  maniere 
semblable,  un  systeme  de  traduction  automatique  de  1’allemand  en  anglais  est  utilise  en  traitement  par  lots  avec  post-edition 
pour  traduire  les  titres  des  bases  de  donnees  allemandes  (Zimmermann  et  autres  1 987). 

II  est  bien  evident  que  la  traduction  de  titres  et  de  resumes  pourrait  egalement  etre  faite  par  des  traducteurs  humains.  Mais  il  y  a 
des  arguments  en  taveur  des  systemes  de  traduction  automatique  et  de  traduction  assistee  par  ordinateur:  le  texte  a  traduire  est 
lisible  par  ordinateur,  il  s’agit  done  d’une  base  ideate  pour  utiliser  un  ordinateur.  les  domaines  ou  les  secteurs  vers  lesquels  la 
rubrique/le  texte  est  oriente  sont  normalement  "physiquement”  marques,  de  sorte  que  la  classification  ou  meme  les  fonctions 
de  thesaurus  puissent  etre  employees,  particulierement  pour  le  transfert  lexical  (desambiguisation).  le  vocabulaire  technique 
demande  a  etre  tres  precis,  de  sorte  que  I'ordinateur  aide  en  etant  homogene. 

Indexage  automatique. 

I.'indexage  (complet)  d’un  texte  peut  etre  une  applieation  seeondaire  valable  de  ('utilisation  des  systemes  de  traduction.  En 
effet,  pour  le  transfert  lexical,  on  doit  faire  deriver  les  formes  de  mots  des  formes  de  base,  on  doit  identifier  les  mots  composes 
et  les  mots  complexes  en  tant  que  tel.  les  categories  de  mots,  voire  les  relations  entre  les  termes  aux  fins  de  desambiguisation. 
Ainsi  peut-on  envisager  qu'un  systeme  de  traduction  soit  utilise  de  fagon  definitive  ou  intermediate  pour  I’archivage  de 
documents  et  la  recherche  documentairc. 

Traitement  de  texte. 

II  n'y  a  aucun  doute  que  le  traitement  de  texte  joue  un  role  important  dans  chnque  environnement  de  traduction.  Meme  les 
traducteurs  (humains)  independants  utiliseront  de  plus  en  plus  un  systeme  de  traitement  de  texte  (sur  PC),  et  I’on  tend,  petit  a 
petit,  a  integrer.  d  une  maniere  ou  d  une  autre,  les  glossaries  ou  les  listes  (personnels)  et  de  les  rendre  accessibles  par 
I'intcrmediaire  de  "fenetres"  —  au  lieu  d'utiliser  les  fichiers  manuels.  Il  est  bien  evident  que  d'autres  fonctions.  par  exemple,  les 
controleurs  orthographiques,  seront  de  plus  en  plus  integres  dans  un  tel  processus. 

En  consequence,  le  "texte  source  "  est  assimilable  par  ordinateur.  mais  on  doit  adapter  les  systemes  de  traduction  aux  differents 
traitements  de  texte  (par  exemple  Wang  OIS  est  combine  aux  systemes  LOGOS  et  SYSTRAN,  WordPerfect  a  SYSTRAN).  Si 
de  tels  outils  sont  disponibles.  la  post-edition  de  la  traduction  brute  pourra  etre  realisee  pai  un  reviseur  confirme. 

Un  des  problenies  qui  se  pose  dans  cet  environnement  est  la  combinaisoi.  de  ces  outils  avec  les  systemess  locaux  de  MT  (par 
exemple.  LOGOS)  ou  la  connexion  a  un  serveur  de  traduction  (voir  I'applieation  de  SYSTRAN  au  sein  de  la  Commission 
europeenne  ou  dans  le  reseau  Minitel  en  France  pour  obtenir  des  traductions  automatiques. 

II  ne  fait  aucun  doute  que  ('utilisation  des  systemes  de  traduction  progressera  principalcment  en  parallele  avec  le  traitement  de 
textes  et  faeces  en  direct.  La  question  est,  pour  finstant,  de  savoir  si  les  outils  existants  sont  assez  puissants  (en  qualite)  pour 
etre  acceptes  par  futilisateur.  En  particular,  les  resultats  de  (experience  de  TAO  mcnee  par  GACHOT  S.A.  sur  Minitel  et  via 
PC  (avec  SYSTRAN)  constitueront  une  appreciation  importante  a  cet  egard. 

Edition  electronique 

De  plus  en  plus,  les  textes  a  traduire  (en  particulier,  les  textes  techniques  comme  les  instructions  de  reparation,  les  manuels) 
sont  entierement  prepares  —  y  compris  les  chiffres,  les  plans,  les  tableaux,  les  images  —  par  fintermediaire  de  (edition 
elcctronique,  que  Ton  connait  aussi  sous  le  nom  de  micro-edition.  Les  societes  qui  utilisent  les  services  de  traducteurs.  internes 
ou  externes,  ne  veulent  pas  remettie  en  forme  (ou  composer)  une  traduction  meme  dans  beaucoup  de  langues  cible. 

Aussi  un  grand  effort  doit-i!  etre  .onsacre  (et  ce,  de  la  part  du  fabricant  de  systemes  d'edition  electronique  et  du  producteur  de 
logicicls  de  traduction)  pour  integrer  les  aides  a  la  traduction  sans  supprimer  ou  violer  la  structure  du  document. 
Naturellement,  il  existe  des  problemes  d’ajustement  de  ligne.  de  paragraphe  ou  de  page  (dus  a  la  difference  de  longueur  entre 
textes  source  et  cible),  et,  le  rearrangement  des  exprcssions/dcs  mots  du  fait  des  differences  syntaxiques  entre  langues.  souleve 
nombre  de  difficulty's  quant  a  ('integration  corrccte  des  descripteurs  typographiques  (caracteres  gras,  soulignement.  etc.)  dans 
la  langue  cible.  Si  I'on  ne  souhaite  pas  confier  ces  elements  "techniques",  qui  font  partic  de  la  traduction,  au  traducteur/reviseur. 
il  convient  d’integrer  aux  systemes  de  micro-edition  des  moyens  plus  performants  de  description  de  la  forme  du  texte.  Voila  le 
deft  que  les  systemes  de  TAO  existants  et  futurs  doivent  relever. 
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Convivialite. 

Pour  le  poste  de  travail  du  traducteur,  la  convivialite  joue  un  role  important  a  chaque  niveau  des  systemes  de  traduction.  Pas  un 
traducteur  humain  ne  souhaiterait  utiliser  un  systeme  de  traduction  et  devoir  jouer  le  role  d'esclave  qui  consisterait  a  corriger 
jour  apres  jour,  les  memes  erreurs  “grossieres"  de  traduction  du  systeme.  Le  danger  existe  a  l’heure  actuelle,  parce  que  les 
systemes  existants  ne  sont  pas  suffisamment  flexibles  et  adaptes. 

Les  developpements  futurs  des  systemes  de  traduction  doivent  done  se  concentrer  sur  les  activity  qui  donnent  a  l’utilisateur 
davantage  de  possibility  de  retroaction.  Le  “concept  du  fichier  personnel”  qui  est  parfois  employe  dans  les  systemes  de  base  de 
donnees,  oil  un  utiiisateur  peut  choisir  et  creer  une  “partie  personnelle”  dans  une  base  de  donnees,  pourrait  etre  un  exemple:  au 
moins  au  niveau  du  dictionnaire,  Putilisateur  devrait  disposer  de  fonctions  pour  realiser  —  sur  la  base  des  donnees  existantes  — 
son  ou  ses  propres  dictionnaires  (physiquement  ou  logiquement).  A  et  egard,  ausssi  bien  les  fabricants  de  systemes  de  base  de 
donnees  et  les  utilisateurs,  peuvent  tirer  profit  d’un  tel  concept;  le  vocabulaire  specialise  du  systeme  sera  ameliore  et 
Putilisateur  aura  une  influence  non  negligeable  (mais  aussi  responsable)  dans  le  choix  des  traductions. 

Ce  qui  est  vrai  pour  la  partie  lexicale,  devrait  egalement  etre  applicable  aux  elements  structurels  (par  exemple.  pour  agir  sur  la 
longueur  des  phrases,  ou  certains  composants  stylistiques,  etc...).  Les  systemes  existants  devraient  offrir  davantage  de  flexibility 
et  les  systemes  futurs  devraient  prendre  en  considerations  de  tels  elements  des  le  commencement. 

Rentabilite. 

Commercialement  parlam,  la  decision  d'utiliser  un  systeme  de  traduction  est  basee  sur  un  calcul  de  rentabilite.  Dans  ce  cas,  ce 
n'est  pas  uniquement  un  probleme  financier  pur,  parce  que  le  “temps  e'est  de  Pargent  et  qu’une  traduction  obtenue  sans  tarder 
peut  avoir  un  prix  plus  eleve. 

Mais  en  fin  de  compte,  la  decision  est  prise  sur  une  base  economique.  en  tenant  compte  des  effets  sociaux  ou  humains. 

Partant  du  fait  que  les  renseignements  precis  sont  rares  sur  le  cout  de  developpement  des  systemes  de  TAO,  pour  autant  que 
Pauteur  le  sache,  nous  devons  etudier  la  question  de  la  rentabilite  au  niveau  de  Putilisateur. 

II  semble,  a  Pheure  actuelle,  que  la  quantite  de  traductions  (mesurees  en  pages/jour)  qui  sont  produites  par  un  traducteur 
humain  par  interaction  et/ou  post-edition  peut  etre  sensiblement  augmentee.  En  supposant  que,  par  exemple,  la  maintenance 
informatique  et  Papplication  technique  d'un  systeme  comme  SYSTRAN  couteront  $300,000/  annee,  et  que  300,000  pages 
peuvent  etre  techniquement  traduites  par  le  systeme,  le  cout  de  la  traduction  pure  —  non  compris  la  mise  a  jour  des 
dictionnaires  et  la  preparation  ou  post-edition  de  la  traduction  brute  —  est  negligeable  ($  1/page).  Le  cout  du  processus 
complet  (traduction  avec  interaction  homme/machine)  differe  selon  la  qualite  souhaitee.  Pour  une  traduction  d’assez  bonne 
qualite  —  ce  qui  signifie  une  traduction  informative  par  exemple,  pour  les  documents  de  travail,  une  traduction  rapide  peut  etre 
faite,  de  sorte  qu’un  traducteur  humain  produise  environ  20  pages/jour  (au  lieu  de  4-6  pages  sans  TAO). 

Pour  atteindre  un  haut  niveau  de  qualite  comparable  aux  traductions  humaines  professionnelles.  on  doit  consacrer  davantage 
de  temps  a  la  revision  ou  post-edition  Mais  il  semble  que  le  point  d’equilibre  soil  atteint  dans  ce  sens  que  si  le  vocabulaire  d'un 
systeme  de  traduction  est  adapte  au  domaine  de  l'utilisateur,  le  cout  de  traduction  est  inferieur  au  cout  de  traduction  humaine 
pure,  meme  en  considerant  que  (’utilisation  des  systemes  de  traitement  de  textes  dans  le  processus  de  traduction  fait  gagner 
environ  20  %  du  temps. 

Problemes  linguistiques  et  strategiques  de  base. 

L’aspect  morphologique  (a  savoir  les  problemes  d’inflexion,  de  derivation,  et  de  decomposition)  des  systemes  de  traduction  ont 
ete  resolus  avec  succes,  au  moins  a  des  fins  pratiques  d'environnement  duplication,  meme  si  la  traduction  correcte  des  mots 
derives  et  decomposes  n’est  pas  toujours  atteinte  au  moyen  de  regies  automatisees.  Ce  n'est  pas  le  cas  pour  des  solutions  aux 
niveaux  syntaxique  et  semantique.  Meme  si  nous  supposons  qu’un  probleme,  par  exemple,  la  desambiguisation  des 
homographies  syntaxiques  (comme  RAINS  dans  IT  RAINS  ou  THE  RAINS)  peut  etre  resolue  grace  a  un  systeme  d’analyse 
strict  et  entierement  formalise,  la  complexity  de  la  structure  du  langage  naturel  mene  a  un  depassement  du  temps  d’ordinateur 
si  on  essaye  d’integrer  ou  de  manipuler  chaque  structure  ou  occurence  (partielle).  Ainsi,  en  realite,  les  systemes 
commercialement  disponibles  essayent  de  raccourcir  le  processus  d’identification  (ou  de  disambiguisation)  par  le  truchement 
de  regies  deterministes  particulieres  ou  de  probability.  En  consequence,  les  systemes  fonctionnent  10000  ou  1000  fois  plus 
rapidement  qu’un  systeme  entierement  base  sur  la  linguistique,  mais  leurs  res  u  I  tats  ne  peuvent  pas  atteindre  la  meme  qualite. 

Aujourd'hui,  le  temps  d’ordinateur  ne  joue  plus  le  meme  role  qu’il  y  a  quelques  annees,  mais  le  temps  d’ordinateur  de 
traduction  automatique  jusqu’a  maintenant  n’est  pas  entierement  negligeable.  Ceci  est  vrai  aussi  pour  la  solution  des  problemes 
d’homonynie,  a  savoir  dans  le  domaine  semantique.  D’une  part,  il  y  a  des  limites  dans  le  traitement  de  la  structure  des  textes  par 
opposition  a  la  structure  de  la  phrase.  Dans  la  plupart  des  cas,  la  base  de  connaissances  d’un  systeme  de  traduction  est 
1’environnement  de  la  phrase,  ce  qui  signifie  que  les  informations  ou  les  solutions  des  phrases  precedentes  ne  sont  pas  prises  en 
compte  et  que  presque  aucune  donnee  n’est  connue  au  niveau  des  textes.  Ceci  souleve  beaucoup  de  problemes,  en  particulier 
dans  le  domaine  de  la  reference  pronominale,  mais  egalement  dans  l’insertion  d’article  et  la  desambiguisation  des  homonymes. 
Dans  les  systemes  existants,  la  resolution  du  probleme  general  des  ambiguity  semantiques  passe  par  1’introduction  de  codes 
semantiques  (qui  —  sur  un  plan  general)  —  joue  egalement  un  certain  role  dans  la  dyambiguisation  des  des  structures 
syntaxiques),  particulierement  les  marqueurs  par  domaine  ou  par  discipline,  qui  sont  utilisy  pour  choisir  le  “bon”  mot  (ou 
sequence  de  mot)  selon  les  parametres  specifiques  donnees  par  l’utilisateur.  Ils  essayent  egalement  de  ryoudre  ce  probleme 


par  une  consultation  du  dictionnaire  afin  d'identifier  les  sequences  de  mots  ou  meme  les  citations  (qui  doi.ent  normalement 
etre  lexicalisees  parce  qu’il  n'existe  aucune  solution  algorithmique),  voir  1’exemple  en  allemand  de  ES  REGNET 
BINDFADEN,  qui  doit  etre  traduit  par  IT  RAINS  CATS  AND  DOGS. 

Depuis  Chomsky,  1'approche  structurale-semantique  systematique  a  1’analyse  de  la  langue,  a  la  “comprehension”  et  a  la 
traduction  a  progresse.  Ainsi,  au  niveau  de  la  recherche,  on  dispose  maintenant  de  plusieurs  types  de  grammaire  formalises  et 
de  systemes  d’analyse  disponibles.  Les  recherches  en  traduction  automatique  continuent,  que  ce  soit  au  Japon  (voir  le  MU- 
SYSTEM),  ou  en  Europe  (a  savoir  les  efforts  consacres  par  la  Communaute  europeenne  et  les  Etats  membres  pour  le  systeme 
europeen  de  traduction  EUROTRA).  Mais  le  developpement  linguistique  informatique  n’est  pas  suffisant:  les  linguistes,  les 
informaticiens,  les  documentalistes  et  les  utilisateurs  doivent  cooperer  dans  les  projets  a  grande  echelle  pour  viser  le  but  d’une 
utilisation  pratique. 

Exemples 

Pour  donner  quelques  notions  sur  letat  de  la  question  des  systemes  dits  productifs  (pour  ne  pas  dire  commercial),  et  pour 
montrer  egalement  comment  les  criteres  mentionnes  peuvent  etre  appliques,  on  decrira  deux  systemes,  les  systemes 
SYSTRAN  et  SUSY/STS. 

SYSTRAN 

SYSTRAN  (droits  commerciaux  a  Gachot  S.A.,  France)  —  dans  sa  nouvelle  version  3.7  —  a  les  caracteristiques  suivantes: 

—  Traduction  de  texte  integral.  Meme  si  la  structure  de  la  phrase  n'est  pas  correcte.  si  des  mots  sont  mal  orthographiesu  ou 
s’ils  ne  figurent  pas  dans  le  dictionnaire  du  systeme.  une  traduction  est  faite. 

—  La  vitesse  de  traduction  (selon  la  capacite  d’ordinateur)  atteint  jusqu’a  350.000  mots/heure.  C’est  done  le  systeme  le  plus 
rapide  disponible  sur  le  marche. 

—  Le  systeme  fonctionne  par  paires  de  langue.  Les  traductions  sont  effectuees  dans  les  paires  de  langues:  anglais  —  franyais, 
anglais  —  italien.  franyais  —  anglais,  russe  —  anglais  (USAF),  anglais  —  Japonais  (SYSTRAN  JAPON),  anglais  —  arabe.  Les 
paires  de  langues  actuellement  developpees  sont,  entre  autres,  anglais  —  allemand.  franyais  —  allemand,  allemand  —  anglais  et 
allemand  —  franyais.  La  qualite  depend,  d'une  part,  de  la  disponibilite  des  dictionnaires  specifiques.  Un  effort  considerable  a 
ete  consacre  a  la  Communaute  europeenne  pour  developper  les  dictionnaires  SYSTRAN.  Pour  la  traduction  anglais-franyais. 
le  dictionnaire  contient  maintenant  plus  de  1 50.000  entrees.  La  meme  qualite  n’est  pas  atteinte,  par  exemple.  pour  la  traduction 
allemand-franyais,  qui  en  est  a  sa  phase  de  demarrage,  meme  au  niveau  de  l’analyse  de  la  langue. 

—  L'utilisation  de  SYSTRAN  requiert  des  specialistes  techniques  (et  des  gestionnaires).  Ainsi  seules  des  societes  qui 
disposent  d’un  personnel  specialise  (teilcs  que  la  CEE  ou  USAF)  peuvent  utiliser  SYSTRAN  sur  leur  propre  ordinateur  (si 
I’ordinateur  est  une  unite  centrale  IBM  ou  compatible  IBM).  Mais  il  y  a  une  alternative  interessante:  utiliser  le  systeme  par 
l'intermediaire  de  reseaux  de  telecommunication,  par  exemple  reseau  de  commutation  ou  —  dans  une  optique  tres  “futuriste” 

—  par  videotex.  En  France,  une  application  videotex  (appelee  TELETEL)  en  utilisant  un  telephone  plus  un  ecran  (MINITEL) 
est  deja  operationnelle  (et  est  meme  employe  par  les  eieves). 

—  L’enrichissement  du  dictionnaire.  Jusqu'ici.  les  dictionnaires  SYSTRAN  sont  normalement  faits  par  les  specialistes  du 
systeme.  Le  probleme  principal  n'est  pas  le  codage  lui-meme  (qui  est  tres  complexe.  mais  pourrait  et  sera  resolu  grace  a  des 
interfaces  conviviales),  mais  la  coherence  de  la  base  de  donnees  du  dictionnaire.  Les  dictionnaires  contiennent  des  codes  de 
domaine.  qui  demandent  a  etre  ameliores  de  fayon  a  mieux  correspondre  aux  attentes  de  I'utilisateur. 

—  La  portability  de  SYSTRAN  est  presque  impossible  a  realiser:  SYSTRAN  est  ecrit  en  IBM-ASSEMBLEUR.  meme  si  les 
regies  linguistiques  sont  normalement  ecrites  dans  des  macros  speciales.  Le  systeme  lui-meme  fonctionne  —  comme  mentionne 
ci-dessus  sur  unite  centrale  (IBM  ou  SIEMENS  ou  AMDAHL)  ou  sur  des  ordinateurs  de  dimension  semblable.  Une  revision 
de  logiciel  (peut-etre  sur  UNIX)  est  projetee. 

—  La  qualite  de  traduction  (brute)  differe  selon  les  couples  de  langues  et  la  capacite  du  dictionnaire:  Si  on  se  concentre  sur  le 
franyais-anglais,  les  pourcentages  suivants  peuvent  s'appliquer:  Identification  morphologique:  environ  100  %;  structures 
syntaxiques:  environ  90  %;  desambiguisation  semantique:  entre  80  et  90  %,  selon  l’i.itegration  du  dictionnaire  des  LS  (limited 
semantic). 

—  SYSTRAN  peut  etre  combine  a  plusieurs  environnements  de  traitement  de  texte.  On  peut  utiliser  un  Wang  OIS  (employe 
a  la  CEE),  mais  aussi  utiliser  un  PC  (IBM  compatible)  avec  un  WordPerfect.  De  meme.  on  peut  saisir  les  textes  en  lecture 
optique  et  creer  des  fichiers-texte  avec  WordPerfect,  envoyer  les  donnees  au  serveur  SYSTRAN  (par  exemple  a  Gachot  S.A.. 
Paris)  et  recuperer  le  texte  traduit  en  differe.  Enfin,  un  outil-logiciel  permet  de  post-editer  le  texte  sur  ecran  di vise  (visualisation 
simultanee  du  texte  source  et  cible). 

L’utilisation  du  systeme  de  traduction  elle-meme  se  fait  en  traitement  par  lots.  Pendant  le  processus  de  traduction,  il  n'y  a 
aucune  possibility  d’intervention  (en  depit  du  fait  que  le  gestionnaire  du  systeme  a  la  possibility  d'introduire  des  traductions 
pour  les  mots  inconnus).  D’autre  part,  I'utilisateur  utilise  le  systeme  comme  une  boite  noire,  il  n’a  besoin  d'aucune  connaissance 
du  systeme. 
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II  n’existe  aucune  information  disponible  sur  le  cout  de  developpement  du  systeme.  On  peut  estimer  qu’il  se  situe  entre 
$20,000,000  et  $50,000,000.  Rien  que  pour  la  CEE,  le  cout  d’utilisation  est  de  l’ordre  de  $4,000,000  a  $6,000,000.  Le  cout 
d’exploitation  de  la  version  SYSTRAN  (sans  les  royalties)  avoisine  —  a  mon  avis  —  tes  $300.000/an.  Aussi  si  1’on  prend 
I’hypothese  de  300.000  pages  traduites  par  an  (par  exemple,  a  la  CEE),  le  cout  est  inferieur  a  $l/page  (a  I’exception  du 
traitement  de  texte,  de  la  post-edition  et  du  temps  d’ordinateur).  Le  cout  de  traduction  sur  MINITEL  (videotex)  est  d’environ 
$0, 1 0  par  "fenetre"  (jusqu’a  1 0  lignes)  et  la  traduction  (brute)  automatique  s’affiche  normalement  apres  une  attente  de  45  a  60 
secondes. 

Ceci  conduit  a  la  conclusion  que  SYSTRAN  est  congu  pour  1’utilisateur,  a  savoir  que  son  marche  est  cetui  de  la  "societe  de 
I’information  oil  il  n’est  pas  indispensable  que  la  qualite  atteigne  100  %,  mais  qu'un  utilisateur  qui  ne  connait  pas  ou  pas 
suffisamment  la  langue  ait  une  bonne  comprehension  du  texte  traduit. 

SUSY/STS. 

SUSY/STS,  developpe  dans  le  cadre  d'un  grand  projet  de  recherche  en  linguistique  computationnelle  a  1’universite  de 
Saarbriick  (RFA),  est  maintenant  employe  comme  systeme  de  production  dans  le  domaine  de  la  traduction  de  bases  de 
donnees.  Ce  service  est  fait  par  l'lnstitut  de  la  Recherche  de  l’lnformation  Appliquee  (1AI)  a  Saarbriick.  Les  principals 
caracteristiques  sont: 

—  La  traduction  de  textes  integraux  (au  niveau  de  la  phrase)  est  possible.  Dans  I’environnement  duplication,  on  se 
concentre  sur  la  traduction  du  titre  (et  du  resume). 

—  Comme  SYSTRAN,  SUSY/STS  est  un  gros  systeme  de  traduction  de  portee  generate:  un  texte  est  traduit  raerae  s’il  existe 
des  mots  inconnus.  Dans  un  tel  cas,  le  mot  non  connu  est  restitue  dans  le  texte  cible. 

—  Dans  une  certaine  mesure,  SUSY/STS  est  un  systeme  de  traduction  multilingue.  Seul  la  phase  de  transfert  (entre  deux 
langues)  est  bilingue,  tandis  que  les  phases  d'analyse  et  de  synthese  sont  effectuees  independamment  de  la  langue  source  et 
cible.  Les  applications  SUSY/STS  sont  possibles  pour  les  langues  Allemand  —  anglais,  anglais  —  allemand  et  russe  — 
allemand,  et  des  efforts  sont  consacres  au  developpement  du  frangais  —  allemand  et  esperanto  —  allemand.  L’application 
essentielle  est  faite  en  allemand  —  anglais,  avec  un  dictionnaire  de  base  d'analyse  de  l'allemand  d'environ  1 50,000  entrees,  un 
dictionnaire  d’expressions  d'environ  140,000  entrees,  et  un  dictionnaire  allemand  —  anglais  d'environ  80,000  entrees. 

—  Comme  SYSTRAN,  SUSY/STS  exige  un  personnel  specialise  pour  fame  fonctionner  l'application.  Par  consequent,  on  se 
concentre  sur  un  concept  de  service  de  traduction.  Les  traductions  sont  effectuees  uniquement  au  centre  de  calcul  de  1'IAl,  a 
Saarbriick.  Les  clients  doivent  envoyer  leurs  donnees  sur  bandes  magnetiques  a  1’lAI,  ils  obtiennent  en  retour  une  bande 
magnetique  comprenant  les  traductions.  Les  applications  principales  sont,  comme  mentionne,  la  traduction  en  anglais  de  titres 
allcmands  venanl  des  differentes  bases  de  donnees.  En  outre,  la  terminologie  bilingue  dans  les  differents  domaines  est  mise  a 
jour,  les  dictionnaires  sont  enrichis  avec  la  terminologie  specifique  pendant  le  traitement  des  donnees  du  client.  Les  clients, 
entre  autres,  sont:  Le  Centre  d’Information  pour  la  Construction  du  Batiment,  oil  une  base  de  donnees  bilingue  (ICONDA) 
existe;  I’Office  des  Brevets  allemand  oil  —  a  l'heure  actuelle  —  I'lndex  des  mots-cles  en  allemand  est  traduit  en  anglais;  l'lnstitut 
allemand  des  Normes,  oil  les  titres  des  normes  industrielles  allemandes  sont  traduits  en  anglais  pour  etre  integres  dans  la  base 
de  donnees  correspondante;  le  Centre  d'Information  pour  les  Sciences  Sociales,  oil  les  titres  dune  base  de  donnees 
bibliographiques  sont  traduits.  Jusqu'ici.  plus  de  100.000  titres  ont  ete  traduits  par  ce  service. 

—  Le  systeme  fonctionne  sous  UNIX  (et  est,  dans  ce  sens,  portable).  Le  langage  de  programmation  est  FORTRAN,  avec 
certaines  parties  de  base  ecrites  en  “C”.  Une  version  est  egalement  disponible  sur  l'ordinateur  central  de  SIEMENS. 

—  La  qualite  de  la  traduction  automatique  (brute)  (allemand  —  anglais)  est  satisfaisante;  environ  99  %  d'identification  des 
mots  dans  la  langue  source  au  niveau  morphologique;  environ  90  %  d’identifications  correctes  au  niveau  syntaxique.  II  n’y  a 
aucune  statistique  disponible  sur  l’exactitude  de  la  desambiguisation  semantique,  toutefois  la  mise  au  point  d'outils  permettant 
d’integrer  les  codes  de  domaines  est  en  cours. 

—  Le  service  assure  normalement  la  post-edition  pour  atteindre  un  haut  niveau  de  qualite  de  traduction.  Comme  precise  ci- 
dessus,  le  client  transmet  ses  donnees  sur  bande  magnetique  au  centre  de  calcul.  Dans  un  premier  temps,  on  precede  a  un 
controle  orthographique  pour  corriger  les  mots  mal  orthographies.  Ensuite,  les  mots  inconnus  (essentiellement  dans  le 
dictionnaire  de  transfert  du  systeme  de  traduction)  sont  identifies,  le  dictionnaire  est  alors  mis  a  jour  (par  le  traducteur 
humain).  Le  texte  est  ensuite  traduit  automatiquement  et  post-edite  (sur  ecran  a  (  aide  d’un  PC  ou  d'un  terminal)  par  le 
traducteur  humain.  Apres  une  phase  de  controle,  la  bande  magentique  est  retoumee  au  client.  II  est  possible  d’integrer  un 
systeme  d’indexage,  c’est-a-dire  de  partir  de  la  phase  d’analyse  de  SUSY/STS  pour  engendrer  des  mots  lemmatises  (au  lieu  de 
formes  de  mots)  et  decomposer  ou  deriver  des  mots  complexes  pour  obtenir  des  relations  entre  les  mots  composes  et  les  mots 
simples  desquels  elles  sont  derivees. 

—  Comme  SYSTRAN,  SUSY/STS  fonctionne  simplement  en  traitement  par  lots.  Mais  un  utilisateur  specialement  forme 
pourrait  integrer  des  mots  nouveaux  dans  les  differents  dictionnaires:  1’enrichissement  de  dictionnaire  se  fait  en  mode 
conversationnel.  La  post-edition  peut  etre  faite  separement  sur  PC  ou  par  un  terminal  sur  l’ordinateur  centra!  (Nixdorf 
TARGON  35  au  Centre  de  calcul). 

—  Le  cout  de  developpement  du  systeme  est  de  1’ordre  de  $5,000,000  (non  inclus  la  recherche  de  base  a  l’universite). 
L’adaptation  a  l’environnement  de  service  STS  se  chiffre  a  environ  $200,000.  Le  cout  de  base  de  fonctionnement  de 


1'ordinateur  est  de  $  1 50,000/an,  ce  cout  incluant  le  personnel  (pas  les  traducteurs).  Le  service  atteint  son  seuil  de  rentabilite  a 
partir  du  moment  ou  environ  400.000  titres  ou  phrases  ont  ete  traduits,  (prix  d‘un  litre:  2  DM  =  environ  $0,80). 


—  Comme  conclusion,  on  peut  dire  que  SUSY/STS  est  un  systeme  de  traduction  specialise.  Sa  vitesse  (traduction  de  5.0(H) 
mots/hcure.  temps  CPU)  n'est  pas  comparable  a  SYSTRAN,  et  il  n’y  aura  pas  d'augmentation  des  couples  de  langues. 
Toutefois,  on  peut  presumer  que,  en  prenant  en  compte  les  possibilites  et  I'adaptation  aux  besoins  specifiques,  les  systemes  de 
traduction  joueront  un  role  interessant  a  I'avenir. 

Dans  les  limites  de  cet  article,  il  n'est  pas  possible  de  donner  une  description  precise  des  autres  systemes  de  traductions 
existants  (et  disponibles).  Ceci  exige  aussi  d’etre  au  fait  de  (utilisation  des  systemes  et  d'avoir  une  bonne  connaissance  au 
niveau  conceptuel.  Le  but  de  cet  article  etait  d’exposer  certains  exemples  concrets  afin  de  donner  une  premiere  impression  de 
la  complexity  de  ce  theme.  Ainsi  il  peut  paraitre  aise  d'utiliser  un  systeme  de  traduction  du  point  de  vue  "linguistique"  pur  (dans 
le  sens  que  la  qualite  de  traduction  machine  n’est  pas  comparable  a  la  qualite  d'une  traduction  humaine)  ou  —  d’autre  part, 
d’utiliser  un  systeme  de  traduction  comme  un  outil-logiciel  technique  pur.  Des  progres  ne  seront  accomplis  que  dans  la  mesure 
ou  l’on  tient  compte  des  limites  et  des  possibilites  des  systemes  de  traduction  dans  des  environnements  d'application  concrets. 

5.  Perspectives. 

Si  Ton  se  toume  vers  un  avenir  lointain,  il  semble  assez  ciair  que  la  traduction  professionnelie  pascera  par  des  systemes  de 
traduction.  Ceci  dependra  plus  ou  moins  de  la  disponibilite  des  "bons”  couples  de  langues,  de  I’etat  complet  des  dictionnaires, 
de  la  mise  en  place  du  ’’bon’’  environnement  technique  et,  argument  non  des  moindres,  du  cout  d'utilisation  du  systeme. 

En  attendant,  les  systemes  de  traduction,  en  tant  que  systemes  d’aide  terminologique  a  la  traduction  —  integres  ou  ajoutes  aux 
traitements  de  texte,  joueront  un  role  important,  particulierement  comme  aide  a  la  creation  d'un  texte  dans  une  langue 
etrangere  par  le  redacteur  ou  comme  aide  a  la  comprehension  d’un  texte  lisible  par  ordinateur  par  le  destinataire  par 
telecommunication  (Teletex).  Dans  1’environnement  bureautique.  les  systemes  de  traduction  assistee  par  ordinateur, 
principalement  grace  aux  dictionnaires  bilingues  sur  CD-ROM  ou  sur  disque  dur  seront  utilises  comme  une  variante  d'aide  de 
style  et  de  grammaire. 

Contrairement  a  certain  s  hypotheses  (voir  Hutchins  1986,  p.33 1-334),  nous  ne  voyons  pas  la  necessite  d'un  poste  de  travail 
traducteur.  mais  plutot  certaines  fonctions  specialisees  telles  que  I’aide  a  la  traduction  integree  dans  un  poste  de  travail  du 
redacteur,  de  sorte  que  chacun  tire  profit  de  la  mise  au  point  de  dictionnaires  et  de  thesaurus  automatises. 

Quel  sera  le  role  du  traducteur  humain  dans  1’environnement  de  la  traduction  technique  ?  On  peut  voir  un  systeme  de 
traduction  comme  un  systeme  specialise  qui  toutefois,  n’atteindra  sans  doute  jamais  —  d’une  maniere  generale  —  le  haut  niveau 
de  qualite  des  traducteurs  specialises.  Mais  pour  mettre  au  point,  ameliorer  et  meme  maintenir  de  tels  systemes  de  traduction, 
on  a  besoin  d’ingenieurs  specialises  en  langues  capables  de  faire  fonctionner  et  d’enrichir  ces  systemes.  La  complexity  du 
langage  naturel  et  les  connaissances  requises  exigent  une  cohesion  entre  le  systeme  et  le  traducteur.  Et  il  y  a  fort  a  penser  que  si 
la  traduction  de  textes  devient  meilleur  marche  et  plus  rapide,  la  demande  augmentera,  de  sorte  que,  en  fin  de  compte.  les 
traducteurs  continueront  a  jouer  un  role  substantiel  —  dans  des  conditions  differentes. 
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ABSTRACT 

This  paper  describes  the  advances  which  are  being  made  in  information  retrieval  systems 
to  assist  end-users  and  information  specialists  to  overcome  the  critical  barriers  which  make 
it  difficult  for  them  to  exploit  the  power  of  these  systems.  Technology  which  is  being  applied 
to  allow  users  to  interact  with  information  retrieval  systens  with  greater  ease  and  more  success¬ 
ful  results  is  identified.  To  illustrate  this  process,  the  efforts  of  the  U.S.  Defense  Technical 
Information  Center  to  develop  and  implement  an  integrated,  functional  scientific  and  technical 
information  network  are  described.  This  network  was  purposefully  designed  to  incorporate  both 
end-users  and  their  information  management  intermediaries  in  a  complementary  manner  making  them 
resolute  partners  in  the  work  and  its  rewards.  The  uses  of  technology  modules  —  artificial 
intelligence,  expert  systems,  gateways,  user-friendly  interfaces  —  to  overcame  user  barriers 
are  described  in  the  paper. 

introduction 


The  media  hails  the  arrival  of  the  information  age,  and  we  are  told  that  information  is  the  power  of 
the  future  and  that  we  are  in  the  midst  of  an  Information  Revolution.  From  the  vantage  point  of  the  in¬ 
formation  industry,  the  view  is  much  mare  evolutionary  in  nature.  Information  has  always  equated  to 
power,  and  we  are  exposed  cn  a  daily  basis  to  the  gap  between  media-induced  expectations  and  reality. 

The  fact  that  information  is  a  growth  industry  is  indisputable.  The  number  of  online  databases  has 
increased  from  301  in  1976  (Ref.  1)  to  more  than  2800  in  1986  (Ref.  2).  In  the  1970s,  several  online 
services  supported  these  databases.  Currently,  there  are  over  400  online  services  (Ref.  3).  Advances  in 
data  storage  and  retrieved  capabilities  have  made  it  possible  to  store  vast  volumes  of  information  online 
and  to  retrieve  that  information  in  seconds. 

The  proliferation  of  desktop  computers  has  led  to  the  vision  of  a  "tactical  desktop."  Through  the 
tactical  desktop,  all  the  information  the  desk  occupant  requires  would  be  available  by  hitting  a  key, 
squeezing  a  mouse,  pointing  a  finger,  or  issuing  a  voice  oarmand.  This,  in  turn,  has  led  to  speculation 
that  information  retrieval  experts  soon  will  be  without  purpose,  technologically  obsolete.  Examination  of 
the  facts  reveals  quite  the  opposite  is  true.  New  technology  will  strengthen  the  partnership  between  end- 
users  and  intermediaries.  Through  a  combination  of  technology,  imagination,  and  hard  work  we  will  step 
forward  towards  the  ultimate  information  system. 

BACKGROUND 


The  information  retrieval  systems  environment  consists  of  end-users,  information  specialists, 
database  producers,  and  information  systems  vendors,  using  personal  computers  often  linked  into  office 
automation  systems.  These  information  specialists,  end-users,  producers  and  vendors  have  been  working 
together  for  years.  Recent  technological  advances  in  the  information  systems  field,  the  prevalence  of 
personal  computers  in  the  office,  and  the  introduction  of  office  automation  systems  are  changing  the  roles 
of,  and  the  interaction  among,  the  players.  The  culture  of  the  information  retrieval  market  is  evolving, 
but  the  goal  of  the  culture  remains  the  same  —  provide  needed  information  in  the  most  efficient  and 
effective  manner. 

End-users  are  the  individuals  who  need  information.  End-users  run  the  gamut  from  student  to  chief 
executive  officer,  manager  to  technical  expert.  Their  reasons  for  wanting  information  are  as  varied  as 
the  type  of  information  they  require  and  the  format  in  which  they  find  it  suitable.  Some  end-users  want 
a  quick  but  accurate  answer  or  fact  while  others  merely  want  clues  to  aid  them  in  their  own  discovery 
process. 

Information  specialists  are  intermediaries.  Their  expertise  lies  in  identifying,  retrieving,  and 
delivering  needed  information  to  their  client  population  —  the  end-users.  Information  specialists  have 
many  tools  at  their  disposal,  the  most  powerful  of  which  are  the  online  retrieval  systems.  Through  these 
systems,  they  can  manipulate  massive  amounts  of  data,  sifting  through  the  volume  and  mining  the  relevant 
information  with  deft  precision. 

Database  producers  oollect  information  and  are  responsible  for  the  content  of  a  database.  Database 
producers  may  collect  information  of  a  certain  type,  such  as  all  technical  reports  produced  by  the  United 
States  Department  of  Defense,  or  may  oollect  information  in  a  certain  subject  area,  such  as  geology.  Some 
database  producers  perform  indexing  and  abstracting  services  while  others  provide  full-text  information. 
Database  producers  make  their  information  available  through  information  system  vendors.  In  some  cases, 
the  producer  and  the  vendor  may  be  one  and  the  same. 

Information  system  vendors  provide  the  software  and  hardware  to  allow  users  to  access  databases 
online.  Vendors  control  the  quality  and  capabilities  of  the  options  available  foe  retrieving  the  informa¬ 
tion.  Database  producers  also  control,  to  a  certain  extent,  the  search  options  which  can  be  exercised  in 
conjunction  with  their  files.  For  exanple,  a  title  search  option  is  useless  against  a  file  which  contains 
no  titles. 


Several  years  ago,  the  natural  order  of  the  information  field  was  that  database  producers  and  vendors 
marketed  their  services  and  products  primarily  to  the  information  intermediary,  who  was  familiar  with  the 
content  of  the  database  and  the  retrieval  mechanics  provided  by  the  system  vendors.  The  information 
intermediary,  in  turn,  marketed  retrieval  from  information  systems  as  one  of  the  services  available  to 
end-users. 

The  market  at  that  time  was  characterized  by: 

u  homogeneous  system  user  characteristics.  Users  were  information  specialists  who  would  devote  the 
time  to  learn  how  to  use  online  systems  proficiently.  Expertise  in  this  area  was  a  professional 
prerequisite. 

o  Complex  systems.  An  online  session  required  many  commands,  from  loqon  to  retrieval  to  printing. 

o  Unforgiving  systems.  Omission  of  a  carriage  return  could  be  a  fatal  error. 

o  Extensive  training  requirements.  There  were  multi-volume  instruction  manuals,  multi-day 
introductory  courses,  and  numerous  advanced  courses. 

o  Lack  of  standardization.  If  all  systems  are  complex,  a  user  who  has  mastered  one  will  easily 
beccme  "brand- loyal." 

o  Rapidly  changing  state-of-the-art.  Information  professionals  struggled  to  stay  abreast  of  the  new 
databases  and  new  information  systems  continually  being  introduced. 

o  A  unique  professional,  the  information  specialist.  Terminals  and  microcomputers  were  essential 
tools  for  their  profession. 

The  essence  of  the  market  philosophy  was  that  the  information  was  available  for  those  who  had  the 
time  to  dedicate  or  the  need  to  develop  the  skills  required  to  unlock  the  information  banks.  At  the  time, 
that  was  a  sound  philosophy. 

Revolution  Turned  Evolution 

The  introduction  of  personal  computers  into  the  office  (end-user)  environment,  combined  with  the 
developing  awareness  in  end-users  that  the  information  available  online  could  provide  a  competitive  edge, 
played  a  major  role  in  alterinq  the  above-stated  philosophy.  (The  perception  that  the  personal  computer 
would  become  as  commonplace  as  the  telephone  in  the  office  and  the  home  environment  was  probably  more 
important  than  the  reality. ) 

It  did  not  take  long  for  database  producers  and  vendors  to  have  visions  of  end-users  armed  with 
personal  computers  performing  their  own  searching.  Think  how  many  hundreds  of  end-users  exist  for  each 
information  specialist.  And,  we  all  know  that  the  amount  of  information  end-users  ask  for  is  a  function 
of  the  distance  from  their  desk  to  the  location  at  which  the  question  must  be  posed.  Reducing  the  dis¬ 
tance  between  chair  and  answer  to  zero  could  have  a  tremendous  effect  on  the  end-users  information  needs. 

The  idea  of  doing  their  own  searching  appealed  to  many  end-users.  They  were  aware  of  the  quantity 
of  information  available  online  to  help  them  meet  their  goals.  They  could  have  all  the  information  they 
ever  wanted  at  their  fingertips.  The  Information  Age  had  arrived. 

End-users,  database  producers,  and  vendors  rushed  to  embrace  each  other.  "The  End  Users  are  Ccming!" 
became  a  familiar  cry  throughout  the  information  oamunity.  Information  intermediaries  were  told  their 
jobs  were  at  risk  as  end-users  learned  to  use  remote  terminals  to  unlock  information  sources. 

Yet,  for  all  of  this  fanfare,  the  predicted  multitudes  of  end-users  have  not  arrived  yet,  and  direct 
user  accessing  of  online  information  remains  an  almost  arcane  art.  The  encounter  between  the  end-users, 
in  their  new  role,  and  the  information  world  yielded  an  important  lesson: 

Lesson  One:  As  a  general  rule,  end-users,  unlike  information  intermediaries,  will  not  devote 

extensive  time  learning  to  use  a  system. 

End-users  found  that  walking  to  the  library  was  a  lot  faster  than  learning  to  use  the  retrieval  systems. 

Vendors  then  set  out  on  a  quest  to  find  the  perfect  user  interface. .  To  attract  end-users,  the 
vendors  would  have  to  focus  on  making  these  interfaces  "user-seductive."  Several  systems  deemed  to  be 
"user  friendly"  were  put  on  the  market.  More  lessons  were  learned  from  this  experience: 

Lesson  Two:  Lesson  One  is  the  only  general  rule  that  applies  to  end-users  across-the-board. 

Lesson  Three:  End-users  are  a  diverse  grotp  with  many  diverse  needs.  They  require  different  types 

of  data,  different  products  and  services,  different  user  interfaces,  and  different 
marketing  strategies. 

Lesson  Four:  User-friendliness  is  an  attribute  which  is  bestowed  by  the  user  and  not  an  innate 

state  of  being  which  is  created  throuqh  system  design. 

The  interfaces  that  failed  early  suffered  from  the  fact  that  the  market  was  in  its  infancy  and  that 
end-users  were  viewed  as  a  single  market.  As  a  result  of  these  early  painful  experiences,  the  vendors 
began  to  segment  the  end-user  market.  Products,  services,  and  market  strategies  were  tailored  to 
individual  segments.  Ease  of  use  and  variety  of  information  were  the  key  ingredients  in  the  strategy  for 
success  and  the  foundation  of  a  new  market  philosphy.  Therefore,  database  vendors  began  investing  in  the 
development  and  application  of  new  technology. 
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Vftiich  brings  us  to  another  important  lesson: 

Lesson  Five:  End-Users,  even  in  the  abstract,  have  considerable  market  leverage  that  benefits  the 
entire  information  retrieval  field. 

The  information  systems  that  sure  evolving  directly  benefit  information  specialists  as  well  as  end-users. 
The  continued  evolution  of  information  systems  and  accrual  of  benefits  to  the  oamnunity  will  require  a 
commitment  from  sill  involved  parties. 

The  Challenge 

As  noted  earlier,  the  nutber  of  databases  in  existence  and  accessible  is  increasing  dramatically. 

This  trend  will  continue,  and  we  will  see  the  addition  of  more  "types*  of  databases.  Nuneric  and  factual 
databases  will  begin  to  number  with  bibliographic  databases  and  eventually  surpass  them  as  more  database 
vendors  move  toward  providing  full-text  online. 

The  increase  in  the  nunber  of  online  databases  means  that  there  is  more  information,  and  more  types 
of  information,  readily  available.  As  more  information  becomes  available  online,  the  probability  of 
locating  needed  information  also  increases  dramatically.  Consider,  for  exanple,  the  ease  with  which 
details  frcm  a  newspaper  article  can  be  located  when  full-text  articles  cure  available  online.  And,  when 
information  cannot  be  located  online,  we  can  conclude  with  greater  certainty  that  the  subject  has  not 
yet  been  covered  in  the  literature. 

The  challenge  which  arises  frcm  having  so  much  information  available  is  how  to  provide  selectivity. 

By  what  measures  do  we  ensure  that  the  proper  sources  are  utilized  and  that  relevant  information  is 
retrieved?  How  do  we  avoid  leaving  out  important  sources  of  information  and  retrieving  volumes  of 
irrelevant  data?  Some  of  the  difficulties  associated  with  this  task  are: 

o  maintaining  a  constant  alert  for  new  databases/services 
o  registering  for,  and  maintaining  accounts  with,  each  of  the  services 
o  selecting  the  right  service (s)  for  the  query 

o  learning  the  diverse  command  structures  associated  with  each  of  the  services 
o  becoming  knowledgeable  regarding  the  terminology  used  in  each  database 
o  merging  and  analyzing  results  from  multiple  services 

Many  of  these  problems  are  being  resolved  through  the  application  of  new  information  technologies. 

Technology  to  Meet  the  Challenge 

Recent  advances  in  information  retrieval  systems  toward  easier  and  more  successful  use  have  focused 
on  developing  and  applying  interface  technology.  These  advances  range  from  the  development  of  a  simple, 
interactive  menu  for  searching  a  single  system  to  the  incorporation  of  gateway  and  artificial  intelligence 
technology  to  aid  the  user  in  navigating  through  numerous,  diverse  information  systems  and  culling  out 
relevant  information.  The  end  goal  is  to  allow  users  to  interact  with  these  systems  via  an  expansive 
natural  language  interface,  but  the  technology  required  for  this  effort  is  not  mature  enough  to  meet  the 
task.  In  the  interim,  technology  is  in  place  that  provides  users  with  options  for  executing  information 
searches.  Seme  of  the  major  alternatives  are  described  below. 

Interactive  Menu.  The  objective  of  interactive  menu  development  is  to  design  the  system  so  straight- 
forward  that  a  novice  user  can  initiate  the  basic  operations  required  to  perform  a  search.  The  menus 
remove  the  requirement  for  the  user  to  learn  obscure  retrieval  protocols.  Often  the  design  incorporates 
an  option  of  using  a  limited  number  of  natural  language  commands  or  command  abbreviations  as  the  user 
becomes  more  familiar  with  the  system.  These  interfaces  may  be  built  into  the  information  system  itself 
or  executed  from  a  remote  centred  processor  or  microcomputer  system. 

Common  Command  Language  (CCL).  The  CCL  alternative  requires  that  the  user  know,  or  is  willing  to 
learn,  at  least  one  set  of  database  retrieval  commands.  The  CCL  may  be  based  on  a  national  or  interna¬ 
tional  standard  for  database  retrieval  commands,  or  the  CCL  may  be  a  system's  existing  commands  which  are 
translated,  transparently  to  the  user,  to  other  systems 'languages  so  that  other  systems  respond  to  the 
same  commands.  Systems  are  being  developed  which  provide  the  user  with  the  choice  of  using  a  standard  or 
an  existing  command  structure.  The  CCL  interface  may  be  built  into  the  information  system  itself  or 
executed  frcm  a  remote  central  processor  or  microcomputer. 

Gateways.  Gateways  allcw  users  to  interconnect  with  multiple,  diverse  information  systems.  In 
essence,  gateways  act  as  intelligent  switches.  Hie  level  of  intelligence  is  based  on  system  design.  At 
the  lowest  level  of  functionality,  gateways  handle  the  computer  protocol  rand  handshaking  so  that  systems 
can  talk  to  one  another.  Basic  capabilities  beyond  simple  interconnection  include  a  design  that  relieves 
the  user  of  the  need  to  register  for  multiple  systems,  to  learn  telecommunication  paths  and  protocols,  and 
to  memorize  system  logon  and  logout  procedures.  More  advanced  gateway  systems  incorporate  menus,  common 
command  language,  data  analysis  routines,  artificial  intelligence,  and  related  technology. 

Gateways  can  be  run  in  environments  ranging  from  single-user  microcomputers  to  multi-user,  central 
processing  mainframes.  The  number  of  resources  targeted  for  interconnection,  the  number  of  users,  and  the 
"intelligence"  required  in  the  gateway  determine  the  configuration  requirements. 

Knowledge— Based  Interface.  Menus,  common  command  languages,  and  gateways  bypass  significant  barriers 
to  information  retrieval,  but  they  have  not  broken  through  the  human  language  barrier.  Through  the 
development  of  knowledge-based  interfaces,  we  are  inching  towards  the  goal  of  natural  language  search 
interfaces.  Knowledge-based  systems  use  artificial  intelligence  (AI)  to  make  the  hunan-machine  interfaces 
more  human-like  and  more  natural.  The  system  performs  in  a  manner  that  simulates  intelligence;  it  makes 
educated  decisions.  To  get  a  system  to  this  point,  we  must  build  a  knowledge  base  similar  to  that  of  a 
hunan  expert.  This  requires  analyzing  the  knowledge  that  the  hunan  has  and  the  thought  processes  used  to 


make  decisions.  AI  programming  tools  can  then  be  used  to  program  this  knowledge  into  the  system.  The 
result  is  often  called  an  "expert  system." 

Building  a  knowledge-based  system  is  a  complex  task.  The  system  is  usually  limited  to  a  narrow 
subject  scope.  To  be  successful,  the  system  must  be  programmed  with  a  thorough  knowledge  of  the  subject 
area  and  the  natural  language  used  by  human  experts  and  their  clients.  Designing  an  interactive  natural 
language  interface  is  a  difficult  task.  Most  of  the  knowledge-based  information  system  interfaces  de¬ 
signed  thus  far  relate  to  medical  information.  A  cancer  research  file  is  an  example.  The  medical  field 
has  a  fairly  standard  vocabulary  which  makes  it  a  good  candidate  for  an  expert  system.  An  expert  system 
with  the  knowledge  of  2  li'varicr.  or  infonr.'-tiou  at«jialisf  1.  not  iiow  possible,  ana  a  development  of  such 
a  system  is  a  long-term  effort  at  best.  But  overcoming  the  communication  barrier  a  function  at  a  time 
through  use  of  the  appropriate  technology,  whether  menus  or  gateways  or  artificial  intelligence,  is  essen¬ 
tial  to  the  continued  evolution  of  information  systems.  It  becomes  even  more  critical  as  the  volume  of 
data  we  have  to  deal  with  increases. 

The  table  below  provides  a  sampling  of  interface  software  on  the  market  today  (Ref.  4).  This  list  is 
by  no  means  exhaustive.  It  notably  omits  noo-comnercial  products  which  have  been  developed  by  the  govern¬ 
ment  and  university  oenmunities.  However,  it  does  provide  an  indication  of  the  interest  and  activity  in 
the  interface  field. 


Table  1 

Representative  Interface  Systems 

Software  System 

Function 

User  Level 

BCN' s  Super  Scout 

System  access  to  15  services  including  DIALOC 

Western  Union' s  Easy  Link  (electronic  mail) 

Novice 

EasyNet 

System  access  to  more  than  10  services  including 

DIALOG,  SDC,  BRS  and  NLM 

Novice 

IN- SEARCH 

PRO-SEARCH 

System  access  to  DIALOG  and  BRS  with  cross- system 
emulation  and  general  telecomminications  capability 

Novice 

Expert 

INSTANTCOM 

— 

Upload/Download.  General  telecomrrunications  to  DIALOG, 
BRS,  Dow  Jones,  and  other  systems 

Intermediate 

MCI  Mail 

Electronic  mail  and  system  access  to  Dow  Jones  and 
other  systems 

Novice 

PC/Net-Link 

Net  Search 

System  access  to  DIALOG,  SDC,  NLM,  Dow  Jones,  Lexis, 
OCLC,  RLIN,  West law.  Questel  and  Newsnet  with  database 
manager  (Net  Search) 

Novice 

PC-Talk 

Upload/ Down load.  General  telecommunications  capabili¬ 
ties 

Intermediate 

Scihfate 

System  access  to  ISI,  DIALOG,  BRS  and  NIM  with  optional 
database  manager 

Novice/ Intermediate 

Search  Helper 

System  access  to  six  Information  Access  Co.  databases 
or.  DIALOG 

Novice 

Searchmaster 

System  access  to  SDC,  BRS,  DIALOG  and  NIM. 

Expert 

CASE  STUDY 


The  following  case  study  describes  the  efforts  of  the  U.S.  Defense  Technical  Information  Center 
(OTIC)  to  develop  and  inplement  an  integrated,  functional  scientific  and  technical  information  network. 
This  network  was  purposefully  designed  to  incorporate  both  end-users  and  their  information  management 
intermediaries  in  a  complementary  manner  making  that  resolute  partners  in  the  exploration  of  how  new 
technology  can  be  applied  in  an  actual  information  service  environment. 

In  1980,  the  U.S.  OTIC  was  assigned  functional  responsibility  to  develop  and  implement  a  coordinated 
Scientific  and  Technical  Information  Network  (STINET).  The  purpose  of  the  network  was  to  facilitate  DcO 
access  to  scientific  and  technical  information  relevant  to  DcD  mission  areas.  Key  elements  of  the  STINET 
were  identified  as:  DcO  managers  and  scientists;  DoD  Libraries  and  Information  Analysis  Centers  (IACs); 
DcO  Laboratories;  OTIC;  and  other  DoD  federal,  cortmercial  and  foreign  databases,  systems  and  sources  of 
STI.  OTIC  was  tasked  to  provide  a  focus  for  molding  the  STINET,  establish  interoperability  anong  the  var¬ 
ious  network  components,  and  coordinate  network  evolution.  OTIC  was  to  achieve  this  vision  by  promoting 
resource  sharing  and  cooperative  efforts  and  through  investigation,  experimentation,  and  application  of 
advanced  information  science  and  technology. 

Development  of  a  STINET  was  a  huge  and  challenging  task  which  had  to  be  accomplished  within  existing 
financial  and  personnel  constraints.  In  order  to  make  visible  progress  towards  the  STINET,  OTIC  had  to 
carefully  identify  network  requirements  and  evaluate  the  currently  available  technology  in  which  applied 
research  could  be  invested  in  both  the  short  term  and  the  long  term  to  meet  these  requirements.  Next, 

OTIC  had  to  settle  on  an  agenda  of  purposeful  steps  that  would  close  in  on  the  ultimate  goal.  Successful 
development  and  implementation  of  the  network  would  depend  on  devising  a  "doable"  plan  of  work  with  room 
for  deviation  when  opportunities  for  technological  acceleration  became  apparent. 


The  following  elements  would  be  required  for  a  successful  network: 


o  interoperability  and  interconnection 

o  tools  sued:  as  pointers  and  menus,  to  help  locate  information 
o  standardized  command  functions 

o  compatible,  multifunctional,  flexible  software  for  installation  at  network  nodes 
o  interconnection  with  diverse  sources  of  information  including  government,  commercial  and  foreign 
~  interconnection  with  diverse  types  of  information  including  nuneric  and  factual, 
o  selectivity  and  data  analysis  routines 
o  improved  delivery  systems 
o  integration  of  databases  and  people  bases 

The  technology  needed  to  meet  the  requirements  of  the  STINET  was  at  various  stages  of  readiness. 
OTIC  selected  severed,  key  and  promising  areas  in  which  to  invest  development  efforts.  The  goal  of  these 
development  efforts  was  to  provide  the  basic  foundation  for,  and  functional  capabilities  of,  the  STINET. 
These  efforts  are  sunrnarized  in  Table  2  below. 


Table  2 

Scientific  end  Technical  Information  Network  (STINET) 

Initial  Development  Efforts 

Development 

J  Network  Eunction/Description  j 
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Gateway 

Prototype:  DGIS 

Provides  the  backbone  of  the  network.  Allows  interoperability 
and  interconnection.  Interface  incorporates  menu  and  English- 
language  commands.  Accomplishes  automated  connections  to 
diverse,  geographically  distributed  resources,  simultaneous 
searching,  data  uploading /down loading,  and  file  transfer.  UNIX 
operating  system. 

0 

1 

1 

0 

1 

1 

Local  Automation 
Model  (LAM) 
Prototype:  IBIS 

Provides  the  capability  to  create  and  maintain  a  local  database/ 
catalog  in  a  network- compatible  manner.  Unifies  methodology  for 
using  internal  and  external  resources.  Provides  information 
control  mechanisms  such  as  a  circulation  system.  Prototype 
implemented  using  an  integrated  library  system  merged  with  an 
intelligent  gateway. 

1 

1 

1 

1 

1 

0 

Directory  of 
Resources 

Prototype:  Dir. 
of  Databases 

Provides  pointers  and  menus  to  help  locate  information.  A  proto¬ 
type  Directory  of  Databases  is  in  operation,  implemented  using 
the  INGRES  Database  Management  System.  A  Directory  of  Experts 
and  a  Directory  of  Computing  Resources  are  planned. 

1 

B 

1 

1 

0 

- 1 

1 

Common  Command 
Language 

Provides  a  standardized  set  of  commands  for  searching  diverse 
databases.  Implemented  in  C- Language  for  a  limited  set  of  data¬ 
bases.  Switching  to  PROI£G  for  a  production  version. 

0 

B 

B 

0 

fl 

B 

Post-Processing 

Provides  a  tool  box  of  utilities  for  selecting  and  analyzing 
relevant  data.  Dependent  on  translating  data  to  a  common  format 
for  manipulation.  Implemented  in  C- Language  and  UNIX  scripts. 

0 

B 

B 

0 

0 

End-User  Interface 
Prototype:  Search- 
MAESTFO 

Provides  a  mechanism  for  users  who  do  not  possess  retrieval 
expertise  to  search  diverse  databases.  Inplemented  using  the 
EasyNet  service  from  Telebase  Systems,  Inc. 

0 

B 

B 

0 

B 

B 

Expert  Connector 
Intermediary/ 

Expert  Link 

Provides  a  mechanism  for  establishing  people  bases  or  an  expert 
network .  Information  experts  can  be  linked  with  end-users. 
Implemented  using  the  electronic  mail  system  developed  by 

Lawrence  Livermore  National  Laboratory,  the  UNIX  Link  command, 
and  the  SOS  capability  of  Telebase  Systems,  Inc. 

1 

1 

1 

1 

1 

1 

Electronic  Document 
System 

Provides  mechanisms  for  digital  input,  storage,  and  delivery  of 
full  text.  Will  improve  selective  delivery  of  information  to 
users.  Concept  development  stage. 

■ 

0 

0 

0 

■ 

■ 

The  Dot)  Gateway  Information  System,  the  Integrated  Bibliographic  Information  System  and  the 
SearchMAESTRO  system  (described  below)  provide  examples  of  how  this  technology  is  being  developed,  applied 
and  integrated  to  provide  easier  and  more  successful  information  retrieval. 

The  DoD  Gateway  Information  System  (DGIS) 

In  1983,  OTIC  initiated  development  of  the  DoD  Gateway  Information  System  (DGIS),  an  intelligent 
gateway  system  which  would  provide  distributed  networking,  electronic  carmunication,  and  information 
access  and  analysis.  The  DGIS  was  to  link  people,  information  services,  and  computers  pertinent  to  the 
STINET.  The  technology  embodied  in  the  DGIS  would  provide  the  key  menus,  pointers,  interoperability,  and 
interconnection  within  the  STINET.  To  accomplish  this,  DGIS  would  have  to  function  as  an  electronic 
switch,  a  translator,  a  communications  interface  and  a  transaction  controller.  DGIS  would  require  a 
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variety  of  alternatives,  tailored  to  different  user  types  and  needs,  for  obtaining  and  distributing 
information. 

CGIS  focuses  on  streamlining  the  information  retrieval  and  analysis  process.  This  is  accomplished  by 
placing  the  user  at  the  center  of  a  vast  information  universe  consisting  of  people  bases  and  databases  and 
providing  the  user  with  the  navigational  tools  required  to  pinpoint  and  arrive  at  his  destination.  In 
terms  of  databases,  the  CGIS  is  designed  to  provide  users  with  answers  to  the  questions: 

WHAT  RELEVANT  DATABASES  EXIST? 

HOW  DO  I  ACCESS  THEM? 

HOW  do  I  PETRI  EVE  INFORMATION  FRCf'  Hi-.? 

HOW  DO  I  MANIPULATE  THE  RETRIEVED  INFORMATION? 

The  DGIS  provides  a  single,  easy-to-use  interface  for  identifying,  accessing,  interrogating,  and  post¬ 
processing  information  frcm  nurerous  databases  relevant  to  DoD  information  needs. 

In  terms  of  people  bases,  the  DGIS  is  designed  to  answer  the  questions: 

WHAT  EXPERTISE  IS  AVAILABLE  ON  THE  NETWORK? 

HOW  DO  I  COMMUNICATE  WITH  EXPERTS? 

HOW  DO  I  SHARE  INFORMATION  WITH  COLLEAGUES? 

The  DGIS  acts  as  an  integrated  information  system  which  allows  human  experts,  information  users,  and 
information  resources  to  exist  and  interact  in  harmony. 

Development  of  DGIS  is  a  multi-year,  multi-task  project.  A  prototype  system  has  been  developed.  The 
basic  components  of  the  system  are: 

A  DIRECTOR?  OF  RESOURCES,  SUBJECT  SEARCHABLE 

A  COMMON  METHOD  FOR  ACCESSING  AND  SEARCHING  DIVERSE  DATABASES 

TOOLS  EOR  DOWNLOADING  AND  POST-PROCESSING  DATA 

TOOLS  FOR  OOMUNICATING  WITH  A  NETWORK  OF  EXPERTS  AND  COLLEAGUES. 

DGIS  was  designed  for  a  bob  user  caronunity  including  both  intermediaries  and  end-users.  Databases 
accessed  are  federal,  commercial  and  international .  in  addition  to  large,  well-known  databases  and 
systems,  many  small,  specialized  DoD  databases  will  eventually  be  part  of  the  DGIS. 

Once  the  broad  requirements  for  DGIS  were  identified,  a  software  survey  was  conducted  to  determine  if 
a  software  product  already  existed  which  would  meet  DGIS  needs.  The  survey  showed  that  THE  system  was  NOT 
out  there,  waiting.  One  software  package,  the  Technology  Information  System  (TIS),  did  provide  a  suita'  ' ? 
foundation  on  which  to  begin  constructing  DGIS  (Ref.  5).  TIS  was  under  development  at  Lawrence  Livermore 
National  Laboratory  (LLNL)  under  the  sponsorship  of  the  Department  of  Energy  (DOE)  (Ref.  6).  TIS 
functioned  as  an  intelligent  gateway  capable  of  interconnecting  heterogeneous  information  resources  at 
geographically  distributed  locations  in  an  automated,  unified,  and  controlled  manner.  In  addition,  TIS 
downloading  and  post-processing  capabilities  were  already  available  for  selected  databases.  Therefore, 

TIS  was  used  as  the  baseline  for  developing  the  DGIS. 

Directory  of  Resources 


The  Directory  of  Resources  will  include  subdirectories  with  references  to  databases,  people,  and 
computing  resources.  In  the  first  phase,  the  Directory  of  Databases  is  being  developed;  this  is  described 
belcw.  The  Directory  of  People  will  contain  references  to  experts  in  subject  areas  and  information  re¬ 
trieval  techniques  who  may  be  contacted  via  the  network.  The  Directory  of  Computing  Resources  will  con¬ 
tain  references  to  computing  resources,  such  as  super ocrnpu ter s  which  can  be  used  for  data  analysis  and 
modeling,  available  through  the  network. 

In  order  to  develop  a  Directory  of  Databases,  it  was  necessary  to  identify  and  catalog  existing 
databases  and  make  that  information  subject-searchable,  so  that  user  information  needs  can  be  matched  to 
relevant  resources  (Ref.  7).  Although  there  are  many  directories  that  identify  commercial  and  prominent 
federal  databases,  such  information  was  not  readily  available  for  DoD  databases.  To  fill  this  void,  a 
questionnaire  was  addressed  to  the  DoD  Research  and  Development  (R&D)  community  to  identify  existing 
databases,  their  scope,  and  availability.  Over  400  databases  were  identified  (Ref.  8). 

The  next  step  was  to  build  a  database  of  the  DoD  and  DoD-re levant  databases  which  h*3  been 
identified.  A  user  survey  was  conducted  to  determine  database  requirements  (Ref  9).  A  database  schema 
was  developed  for  the  Directory/  and  database  entries  were  subject  indexed.  The  database  was  built  using 
the  INGRES  relational  database  management  system. 

The  result  of  this  effort  is  an  online  Directory  of  Databases  which  contains  information  on  the 
content,  scope,  and  availability  of  selected  databases.  The  Directory  is  subject-searchable;  upon 
entering  a  topic  of  interest,  the  user  is  provided  with  a  list  of  appropriate  databases. 

Interfaces  for  Searching  Diverse  Databases 

One  of  the  primary  goals  of  DGIS  is  to  relieve  the  user  of  the  need  to  learn  and  master  separate 
oommands  and  protocols  for  each  database  accessed.  As  mentioned  earlier,  the  DGIS  target  user  community 
includes  both  end-users  and  intermediaries.  It  is  a  rare  intermediary  who  maintains  proficiency  in  the 
use  of  more  than  five  systems;  for  end-users,  two  systems  is  high  (Ref.  10).  With  the  ongoing  prolifera¬ 
tion  of  databases,  it  is  obvious  that  both  end-users  and  intermediaries  will  benefit  frcm  an  interface  for 
searching  diverse  databases.  OTIC  found  that  end-user  and  intermediary  interface  needs  are  very  different 
when  considered  in  conjunction  with  today's  technology.  An  expansive  natural  language  interface  requiring 
artificial  intelligence  applications  appealed  to  both  populations,  but  could  not  be  accomplished  with 
existing  technology  in  the  short  term. 


A  dual  approach  was  adopted  Cor  the  interface  design,  incorporating  separate  strategies  for 
intermediaries  and  end-users.  Eight  database  systems  were  selected  for  inclusion  in  the  prototype.  These 
included  three  large  federal  systems  and  two  small  DoD  systems. 

A  software  survey  was  conducted  to  determine  if  interfaces  existed  which  met  DGIS  needs.  One  need 
that  influenced  the  result  of  this  survey  was  the  requirement  to  support  low-end,  "dimb"  terminals,  as 
well  as  intelligent  devices,  on  DGIS.  Many  of  the  software  packages  identified  were  designed  for  a  micro¬ 
computer  environment  and  had  to  he  ruled  out  for  use  in  the  prototype.  As  a  result  of  the  survey,  a 
decision  was  made  to  develop  an  interface  for  the  intermediary  and  to  integrate  an  existing  interface  for 
the  end-user. 

For  the  intermediary,  a  command  translator  is  being  developed  which  allows  the  user  to  interact  with 
any  of  the  test  systems  using  the  command  language  he  selects.  The  user,  for  example,  could  search 
NASA/REOCN  using  DROLS  commands  or  the  reverse.  Since  seme  commands  will  not  have  an  equivalent  in 
another  database,  native  command  searching  will  be  retained. 

To  satisfy  the  end-user,  the  EasyNet  database  searching  service  has  been  integrated  into  DGIS  under 
the  name  SearchMAESTRO.  SearchMAESTRO  is  a  menu-driven  database  front-end  which  provides  access  to  over 
900  commercial  databases.  This  service  was  tested  by  members  of  the  DoO  end-user  community  who  were 
delighted  with  the  simplicity  of  search  execution.  In  addition,  a  SearchMAESTRO  interface  to  the  OTIC 
Technical  Report  database  has  been  developed.  SearchMAESTRO  access  is  now  an  option  within  DGIS  and  is 
described  in  greater  detail  in  a  later  section  of  this  paper. 

Post-Processing 

Information  retrieved  from  databases  often  requires  analysis  or  post-processing  in  order  to  become 
useful  to  the  researcher.  DOE  recognized  this  need  and  developed  many  options  for  post-processing  data 
from  DOE/RECON  through  TIS.  A  library  of  post-processing  routines  for  numeric  and  bibliographic  data  was 
available  on  TIS  software  and  was  incorporated  in  the  DGIS  project  (Ref.  11).  In  order  to  post-process 
data,  the  user  downloads  it  into  a  file  on  DGIS,  translates  the  data  into  a  common  format,  and  calls  up 
one  of  several  available  post-processing  routines. 

OTIC  tested  existing  post-processing  capabilities,  developed  recommendations  for  enhancements,  and 
established  prioritites  for  the  expansion  of  the  capabilities  to  other  databases. 

DGIS  User  Interface  Design 

The  interface  incorporated  in  TIS  software  was  structured  to  support  users  with  a  knowledge  of  the 
UNIX  operation  system.  As  a  rule,  the  DGIS  user  ccnmunity  lacked  this  knowledge  and  was  not  inclined  to 
invest  the  time  required  to  acquire  UNIX  expertise.  However,  TIS  software  had  a  flexible  design  which 
allowed  the  user  interface  to  be  tailored  to  the  DGIS  user  community. 

The  DGIS  interface  incorporates  menu  and  command  modes  (Ref.  12).  The  objective  was  to  allow  the 
novice  user  to  interact  with  the  system  with  ease,  by  descending  through  a  series  of  menus.  For  the  more 
experienced  user,  commands  to  execute  systems  functions  were  incorporated.  The  main  menu  of  the  DGIS 
which  shows  the  overall  processes  of  DGIS,  is  as  follows: 


WELCOME  TO  THE  DoD  GATEWAY  INFORMATION  SYSTEM 

»»»»»INFORMATIO:i  TRANSFER  MODULES 

1  directory  DGIS  Directory  of  Online  Resources. 

2  communicate  Connect  to  Information  resources  and  people 

3  process  Information  product  tailoring. 

»»»»»!NFORMATION  UTILITIES 

4  em  Electronic  Mail. 

5  files  File  operations. 


»»»»»SUPPORT  INFORMATION 


help 

users 

info 

dticlog 


Descripfion  of  features. 
DGIS  registered  users 
DGIS  news  and  information 
DGIS  full  text  retrieval. 


DGIS  HOTLINE  NUMBER:  (703)  276-8182 

or  send  questions  via  DGIS  EM  to  'dgishelp.'  • 

Enter  a  menu  number,  a  command.  V  to  backup,  Y  for  top,  or  “e"  to  end. 


People  Bases  and  Databases 

The  goal  In  obtaining  information  is  to  acquire  knowledge.  Much  of  the  information  we  need  resides 
in  the  minds  of  human  experts.  Therefore,  the  DGIS  has  been  designed  to  allow  interaction  e*nong  people, 
hence  the  concept  of  people  bases  as  well  as  databases. 

Through  the  DGIS,  users  will  be  able  to  identify  and  ccnmur.icate  with  experts  and  colleagues  and  to 
connect  to  information  resources.  As  a  first  step,  we  have  focused  on  providing  the  technology  to  allow 
such  interaction.  The  DGIS  menu  for  ccnmuni  cat  ions,  shown  below,  illustrates  how  we  are  integrating 
databases  and  people  bases.  Users  can  access  a  search  interface,  a  database/system,  or  people. 


r  ■*  ^ . . . . ^ . 

2  COMMUNICATIONS 

DGIS  wnl  ajtcmar-caily  cornea  you  to  a  *ice  fang-?  of  remote 
information  systems  and  to  ether  people  online  the  DO  S  Per 
mtormatton  systems,  you  must  have  already  legSKreo  win 
these  systems  and  have  prodded  DGIS  year  access  passwords 
Select  as  to  I  lows 

>>>>>>>> >>SuA»V;H  INTERFACES 

2  1  r ' : £ : r o  SoarchMAESTRO  menu  dr.vr'i  search  a  a 

>'»»>»»NATIVE  MODE 


2  2 

multi 

access  mu :t; -type  informs:  on  systems 

2  3 

t  actual 

access  tactual  and  numeric  catatiases 

2  4 

media 

access  news  services 

>>>> 

>>>>  »>QTHtR  COM 

MUSCAT  IONS 

2  5 

dial 

unassisted  dial  -mo  ether  systems 

2  6 

people 

communicate  interactively  with  DGIS  users 

|  Enter  a  menu  number,  a  command  ”b"  to  cac«up.  T  tor  top  cr  ‘e’  to  end. 

V _  _ ) 

Accessing  a  datahase/systom  is  accomplished  usinq  the  QJNNECT  command  ot  the  DIAL  command.  The 
CONNECT  command  provides  users  with  automatic  access  to  information  resources.  Users  do  not  have  to  know 
telephone  numbers,  Defense  Data  Network  (DON)  locations,  passwords,  access  protocols,  or  1  jt  protocols. 

The  user  enters  the  CONNECT  command  and  a  data  resource  name.  DGIS  then  establishes  a  connection  to  the 
resource  when  the  connection  is  made,  DGIS  logs  the  user  in.  DGIS  uses  TYMNET,  TELEHET,  DDN,  and 
commercial  telephone  lines  to  establish  connections. 

The  CONNECT  command  can  be  used  to  access  information  centers  worldwide.  To  be  eligible  to  use  the 
CONNECT  command  to  access  a  resource,  a  DGIS  user  establishes  an  account  with  that  resource  and  obtains 
the  required  access  identification  information,  such  as  passwords,  whicn  is  then  programmed  into  the  gate¬ 
way  by  the  DGIS  Database  Administrator.  The  billing  process  between  user  and  resource  is  unaffected  by 
gateway  access.  Vendors  maintain  the  same  hillinq  structure  and  users  maintain  the  same  reimbursement 
structure,  regardless  of  the  access  procedures.  iJIIS  has  several  levels  of  security  to  ensure  that 
password  integrity  is  not  violated. 

As  the  number  of  resources  DGIS  connects  to  increases,  we  plan  to  have  DGIS  accounts  that  will 
alleviate  the  need  for  users  to  establish  their  own  accounts  with  each  vendor.  To  this  end  we  are  row 
developing  a  charge-back  mechanism  so  that  we  can  bill  users  for  their  use  of  these  general  accounts. 

Users  who  wish  to  access  a  resource  that  does  not  have  an  automatic  connection  routine  can  take 
advantage  of  the  DIAL  command,  rather  than  the  CONNECT  command.  DIAL  allows  users  to  call  any  information 
center,  computer,  or  terminal,  no  matter  where  the  location.  To  use  DIAL,  the  user  must  know  the  neces¬ 
sary  passwords  and  telephone  numbers.  DIAL  allows  the  user  to  access  an  off-network  facility  while 
retaining  DGIS  capabilities  such  as  downloading  and  file  transfer. 

Once  connected  to  a  resource  through  DGIS,  a  user  can  download  data  from  that  resource  to  DGIS. 
Downloading  data  opens  many  options.  For  example,  data  can  be  reviewed  at  leisure,  merged  with  other 
data,  and  shared  with  other  users  by  allowing  them  to  access  the  file.  Data  files  can  also  be  transferred 
to  other  users  so  that  they  can  manipulate  the  data  to  suit  their  own  needs.  DGIS  allows  data  to  be 
shared  selectively  on  a  worldwide  basis.  The  user  is  responsible  for  ensuring  that  oopyriqhr  laws  are  not 
violated. 

Several  mechanisms  are  available  for  interconnecting  people.  An  electronic  mail  service  is  available 
twenty-four  hours  a  day.  Standard  electronic  mail  features  such  as  send,  receive,  answer,  and  forward  are 
incorporated.  Mail  messages  can  be  sent  simultaneously  to  multiple  addresses  and  to  every  member  of  pre- 
established  mail  groups,  with  lengthy  documents  attached  if  needed.  Users  recognize  the  benefits  of  being 
able  to  communicate  with  numbers  of  people  at  the  same  time  and  of  avoiding  the  "telephone-tag"  routine. 
Messages  can  be  filed  for  future  reference  or  deleted  from  the  system  upon  command. 

In  addition,  electronic  mail  can  be  used  to  send  information  downloaded  from  a  database  and  placed  in 
a  file.  A  user  who  does  not  want  to  do  his  own  database  searching  can  send  a  search  request  to  an  infor¬ 
mation  specialist  without  leaving  his  desk.  The  information  specialist  can  perform  the  search,  download 
the  results  to  a  file,  and  send  the  file  to  the  user.  Since  the  data  is  stored  in  a  file,  both  the  infor¬ 
mation  specialist  and  tne  end-user  can  use  post-processing  routines  for  manipulating  and  analyzing  the 
data.  The  user  can  share  the  file  with  colleagues  by  using  the  electronic  mail  feature. 

WHITE  is  another  communications  option  which  allows  users  online  to  communicate  with  each  other  via 
their  terminals.  To  establish  a  WRITE  session,  the  user  first  enters  the  command  %WHO  to  get  a  display 
list  of  the  users  currently  online.  The  user  then  enters  the  command  %WRITE,  followed  by  the  name  of  the 
user  with  wham  he  wishes  to  communicate.  The  WRITE  command  notifies  the  party  being  called,  who  then  has 
the  option  of  responding.  The  command  TALK  is  a  variation  of  the  WRITE  command  which  provides  a  split 
screen  so  that  both  parties  can  input  at  the  same  time.  The  WRITE  and  TALK  commands  are  only  useful,  of 
course,  when  parties  who  want  to  communicate  are  logged  on,  by  chance  or  arrangement,  at  the  same  time. 

The  LINK  command  allows  two  or  more  users  at  different  locations  to  link  their  terminals  so  that  they 
view  the  same  data  display.  All  users  have  control  over  the  display  and  can  issue  commands  at  will.  Of 
course,  linking  necessitates  a  cooperative  spirit  and  some  coordination. 


Through  the  LINK  command  information  specialists  and  end-users  can  together  perform  interactive 
database  searches.  The  end-user  benefits  frcnt  the  specialist's  expertise,  while  the  specialist  benefits 
from  the  end-user's  immediate  feedback. 

The  LINK  command  can  prove  advantageous  ir.  many  other  situations.  For  example,  an  instructor  can 
provide  online  tutorials  to  a  student  or  a  class  at  a  different  location.  This  technique  was  used  by 
Dr.  Sullivan  of  the  Chemical  Information  System  (CIS),  in  Washington,  DC,  t-  provide  a  demonstration  of 
CIS  to  a  class  in  Brasil.  The  use  of  a  speaker  phone  enhanced  this  demonstration  by  providing 
simultaneous  voice  ocnmunication. 

The  LINK  command  is  also  useful  for  joint  online  editing  or  reviewing  of  reports  among  experts.  This 
practice  eliminates  mail  delays  and  allows  experts  to  discuss  changes  while  viewing  the  data  together. 

User  Support 

As  implementation  and  testing  of  the  various  DGIS  modules  began,  it  became  obvious  that  seme  form  of 
user  support  and  training  was  required  to  ensure  the  success  of  the  system.  A  Gateway  User  Support  arri 
Training  Office  (GUSTO)  was  established  to  satisfy  this  need  (Ref.  13).  GUSTO  provides  a  hotline  service 
which  users  can  call  when  they  have  a  problem.  GUSTO  staff  will  identify  the  source  of  the  problem  (i.e. , 
the  gateway,  the  user's  terminal,  a  teleccmuni  cat  ions  link,  a  remote  system,  etc.)  and  take  action  to 
have  the  problem  resolved.  Users  may  also  contact  GUSTO  staff  using  the  electronic  mail  capability 
available  on  DGIS. 

Developing  user's  manuals  and  providing  training  courses  are  also  GUSTO  responsibilities.  The 
training  course  is  primarily  designed  for  the  professional  searcher  who  wants  to  exercise  the  full  power 
of  the  system,  especially  in  the  area  of  post-processing  of  bibliographic  data.  The  user's  manual  serves 
as  a  reference  tool  for  the  user.  GUSTO  staff  also  poll  the  DGIS  user  community  as  needed  to  identify  new 
system  requirements. 

Prototype  and  Beyond 

A  prototype  DGIS  has  been  developed  and  is  currently  undergoing  test  and  evaluation  within  the  DoD 
community.  There  are  currently  150  users  testing  the  system.  The  DGIS  prototype  is  running  on  a  VAX 
11/780  using  the  UNIX  operating  system,  the  INGRES  database  management  system,  and  the  PROLOG  interpreter 
package.  The  DGIS  software  is  being  ported  to  a  Pyramid  98X,  an  Elxsi  6800,  a  Gould  6050,  and  Sun  Work¬ 
stations  for  benchmarking  and  performance  evaluation.  Based  on  the  results  of  the  performance  evaluation, 
a  hardware  configuration  for  a  production  system  will  be  acquired.  The  production  configuration  may  con¬ 
sist  of  several  machines  networked  together.  For  exanple,  the  common  command  language  and  post-processing 
routines  could  be  isolated  on  a  backend  machine.  DTIC  plans  to  stabilize  a  version  of  the  DGIS  and  offer 
it  as  a  standard  DTIC  service  in  October  1988.  Prototype  development  will  continue  on  a  separate 
development  machine,  and  enhanced  versions  of  the  DGIS  will  be  made  operational  at  selected  intervals. 

The  DGIS  was  developed  in  prototype  as  an  unclassified,  minioemputer-based  centralized  gateway 
system.  As  we  move  this  version  into  operation,  we  plan  to  begin  development  of  a  distributed,  clustered 
gateway  network.  Gateway  nodes  in  the  network  would  be  made  up  of  centralized,  multi-user  configurations 
and  intelligent  workstations  distributed  to  users.  A  selection  of  gateway  capabilities  would  be  available 
on  personal  workstations.  It  makes  sense,  for  example,  to  have  automated  connection  routines,  cannon 
command  languages,  and  post-processing  routines  for  frequently-used  databases  available  on  a  personal 
workstation.  The  centralized,  mini-based  node  would  be  utilized  to  identify  and  search  infrequently-used 
resources  and  for  post-processing  volumes  of  data  which  are  beyond  the  capacity  of  the  personal  work¬ 
station.  Clustered  gateway  nodes  consisting  of  a  centralized  gateway  processor  and  personel  workstations 
could  be  based  on  geographic,  organizational,  or  subject  boundaries.  The  clustered  gateway  nodes  would 
interconnect  and  route  users  to  appropriate  nodes  when  necessary.  Development  of  a  classified  gateway 
system  is  also  underway. 

Integrated  Bibliographic  Information  System  (IBIS) 

Development  and  implementation  of  the  DGIS  allows  OTIC  users  to  connect  to,  and  search  and  analyze 
data  retrieved  from  diverse  unclassified  database  services  in  the  federal,  commercial,  and  international 
sectors.  Development,  as  planned,  of  a  classified  version  of  the  DGIS  will  make  its  reach  almost  limit¬ 
less.  But  what  the  DGIS  has  not  provided  is  tools  for  the  development  of  local  databases  or  catalogs  of 
holdings  and  tools  for  local  collection  management. 

The  DGIS  was  designed  as  an  intelligent  switching  mechanism.  The  resources  targeted  by  the  DGIS  were 
already  online.  DGIS  was  designed  only  to  switch  users  to  automated  resources.  The  Directory  of  Resour¬ 
ces  is  the  only  database  central  to  and  created  and  maintained  on  the  DGIS.  This  is  a  basic  design  philo¬ 
sophy,  not  to  be  altered  for  fear  of  deflecting  the  DGIS  from  its  primary  focus  —  that  of  being  a 
gateway. 

But  there  was  a  need  to  provide  a  vehicle  to  automate  and  manage  local  information  collections  which 
are  manually  maintained,  very  valuable,  and  very  difficult  for  non-local  personnel  to  use.  This  need  is 
acutely  felt  by  the  DoD  library  community,  a  key  component  of  STINET. 

Therefore,  in  1983,  OTIC  initiated  development  of  a  library  automation  system  responsive  to  the 
networking  and  local  collection  management  needs  of  DoD  libraries.  The  system  would  support  centralized 
resource  sharing  while  allowing  local  processing  flexibility.  The  objective  was  to  permit  DcD  libraries 
to  make  maximum  use  of  existing  information,  organize  this  information  to  meet  local  needs,  and  selective¬ 
ly  share  newly-generated  information  with  other  members  of  the  community.  The  system  designed  to  accom¬ 
plish  this  would  have  to  integrate  local  control  for  local  collection  management  functions  (reference, 
cataloging,  and  circulation)  with  access  to  the  external  resources  required  for  reference,  shared 
cataloging,  and  other  network  requirements. 


The  automation  needs  of  the  DoD  libraries  were  defined  through  a  requirements  study  initiated  in  1983, 
which  included  surveys,  site  visits,  and  staff  interviews  throughout  the  DoD  technical  library  comnunity 
(Ref.  14).  Based  on  this  study,  a  prototype  system  to  meet  these  requirements  was  specified,  and  its 
development  becane  the  objective  of  the  IBIS  project.  The  requirements  that  IBIS  is  expected  to  satisfy 
are  stumarized  as  follows. 

Local  Collection  Management:  Local  cataloging,  retrieval,  and  circulation  capabilities  are  essential 
IBIS  system  requirements.  Acquisition  and  serials  management  functions  are  desirable  system  features; 
they  will  be  added  at  a  later  date. 

External  Database  Access:  System  capabilities  to  input  data  to  and  retrieve  data  from  external 
databases  are  critical.  Uploading  and  downloading  capabilities  are  essential.  The  DTIC  TR  database  is  a 
primary  external  resource;  OTIC  TR  access  by  IBIS  is  vital  to  the  cataloging  and  reference  functions  of 
the  libraries. 

Integration  of  Local  Collection  Management  and  External  Database  Access  Capabilities:  These 
capabilities  are  to  be  integrated  on  one  computer  and  accessible  by  an  authorized  user  frcm  a  terminal 
equipped  with  video  display  screen. 

Ccmmon  Command  Set  for  Performing  Functions  Locally  or  Externally:  A  single  command  language  is 
necessary  for  users.  The  IBIS  will  perform  the  necessary  protocol  translations  between  the  single  command 
language  internal  to  IBIS  and  the  diverse  command  structures  of  the  external  databases.  The  oorrmon  com¬ 
mand  set  will  relieve  the  user  of  the  need  to  learn  and  master  separate  languages  and  procedures  for  each 
database  accessed.  However,  "native"  language  access  to  external  databases  must  be  available  to  the 
user. 

Simultaneous  Access  to  External  Data  Sources  and  the  Local  Catalog  for  Reference  Searching:  The 
libraries  are  to  be  able  to  run  the  same  search  query  against  multiple  databases,  local  and  external 
simultaneously.  Search  results  are  to  be  delivered  to  a  single  terminal. 

Post-Processing  of  Retrieved  Data:  The  ability  to  reformat,  merge,  and  sort  data  downloaded  from 
external  sources  is  a  desirable  feature.  This  capability  will  allow  libraries  to  fit  search  results  — 
derived  frcm  external  sources  and  the  local  catalog  —  to  their  patron's  needs  and  deliver  a  single 
product  in  an  economical  and  efficient  manner. 

Flexible  Local  Catalog  Format:  The  IBIS  format  must  be  flexible  to  accommodate  the  diverse  local 
catalog  formats  used  throughout  the  DoD  community.  A  flexible  local  catalog  format  will  encourage  wide 
IBIS  implementation.  This  approach  avoids  the  delays  and  extraordinary  expense  that  would  be  required  to 
resolve  the  issue  of  cataloging  standardization  within  the  DcD. 

Machine-Aided  Citation  Translation  and  Uploading  to  DTIC:  The  IBIS  must  assist  in  translating  biblio¬ 
graphic  citations  frcm  the  local  file  format  into  the  OTIC  format.  This  capability  will  allow  DoD  librar¬ 
ies  to  contribute  data  to  the  OTIC  TR  database  directly  and  efficiently.  This  information  will  then  be 
available  to  the  entire  DoD  community  for  display  and  downloading.  The  resulting  shared  cataloging  will 
contribute  significantly  to  meeting  the  resource  sharing  goals  set  by  DoD. 

Patron  Access:  IBIS  will  verify  a  patron's  right  to  access  before  releasing  information  classified  as 
Defense-sensitive  or  otherwise  restricted.  In  addition,  the  IBIS  will  provide  for  online  communication, 
as  allowable  by  local  security  regulations,  between  patrons  and  the  library  staff  and  among  patrons/ 
colleagues  themselves.  In  some  instances,  this  will  require  that  IBIS  be  connected  with  a  local  area 
network. 

From  Concept  to  Prototype.  The  design  concept  for  the  IBIS  was  formulated  from  the  foregoing 
requirements.  A  software  survey  was  conducted  to  determine  if  commercially  available  software  packages 
could  provide  the  capabilities  required  to  implement  this  concept  directly  (Ref.  15).  A  list  of  30  crit¬ 
ical  software  functions  was  compiled  and  used  to  conduct  the  survey.  Reference  sources,  such  as  the  Data- 
Pro  Psports  and  the  Library  Systems  Evaluation  Guide,  were  used  to  identify  software  packages  specifically 
intended  for  application  in  the  library  environment.  A  total  of  66  potentially  suitable  packages  were 
identified,  and  their  vendors  were  asked  to  respond  to  the  survey.  Analysis  of  the  responses  showed  that 
three  of  the  survey  questions  were  key.  They  were: 

o  Does  your  package  support  online: 

-  Cataloging  (to  include  online  catalog  updating;? 

-  Reference/Catalog  search  and  retrieval? 

-  Circulation  management  and  control? 

-  Serials  management? 

-  Acquisition  management? 

-  Others? 

o  Does  your  package  have  a  gateway  capability  that  includes: 

-  Queries  of  "second"  external  database  (heterogeneous)? 

-  Downloading/retrieval? 

-  Uploading/cataloging? 

-  post-processing? 

-  Telecommunications  link,  auto-dialing,  log-on? 

-  Single  query  language? 

o  Does  your  package  preclude  running  other  applications  not  part  of  your  package? 

The  answers  to  the  first  two  questions  eliminated  the  notion  that  a  "perfect"  software  system  was 
available.  No  single  ccnmercial  or  public-domain  system  provided  the  capabilities  required  to  implement 
the  IBIS  concept.  Integrated  library  systems  that  supported  cataloging  and  reference  functions  existed, 
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as  did  gateway  systems  that  supported  the  querying  of  external  databases,  uploading,  downloading,  and 
simultaneous  searching,  but  no  software  combined  these  functions.  In  addition,  no  software  package  oould 
be  identified  that  contained  a  suitable  common  retrieval  language,  or  one  which  the  vendor  was  willing  to 
adapt  to  satisfy  the  database  access  requirements  of  the  DoD  library  catmunity. 

It  was  the  responses  to  the  third  question  which  gave  hope  that  a  tractable  solution  was  possible. 

The  approach  chosen  was  to  combine  an  integrated  library  system  for  local  library  collection  management 
with  a  gateway  system  for  accessing  external  resources.  Hie  intelligent  gateway  processor  embedded  in  the 
DGIS  was  designated  as  the  gateway  portion  of  the  IBIS.  It  would  support  access  to  external  resources, 
data  downloading  and  uploading,  simultaneous  searching,  and  post-processing.  The  common  command  language 
which  OTIC  was  developing  for  DGIS  would  be  made  available  for  IBIS  also. 

Of  the  66  library  system  packages  identified,  six  were  determined  to  have  the  requisite  features. 
These  six  packages  were  benchmarked,  and  a  selection  was  made  in  August  1985.  At  that  time,  the 
integration  of  the  selected  packages  with  the  gateway  software  begem. 

The  Defense  Nuclear  Agency  (DMA)  was  selected  as  the  prototype  site  for  the  IBIS.  The  DMA  library, 
like  many  of  the  libraries  surveyed  during  the  requirements  definition  phase,  had  a  collection  of  more 
than  100,000  holdings  and  required  a  system  supporting  multiple  users.  The  IBIS  prototype  was  implemented 
at  ENA  on  a  VAX  11/750  minicomputer. 

Scaling  Down.  In  the  initial  stages  of  the  IBIS  project,  it  was  believed  that  the  production  system 
design  selected  as  a  result  of  the  DNA  prototype  test  would  be  suitable  for  implementations  throughout 
DoD.  However,  it  was  determined  during  the  software  survey  that  most  library  packages  are  designed  to 
operate  efficiently  over  relatively  narrow  ranges  of  collection  sizes  arri  transaction  volumes.  The  hard¬ 
ware  configurations  selected  by  the  various  vendors  to  run  their  different  packages  reflected  this  tend¬ 
ency.  It  then  became  evident  that  a  distinct  IBIS  configuration  to  support  smaller  DoD  libraries  would  be 
needed. 

The  hardware  configuration  required  to  support  the  prototype  IBIS  at  DNA,  a  miniccmputer-based  system, 
was  not  economical  for  smaller  DoD  technical  libraries  with  collection  sizes  ranging  from  5,000  to  75,000 
items.  A  microcomputer-based  system  was  more  appropriate  for  the  lower  transaction  volumes  and  smaller 
operating  budgets  associated  with  these  libraries.  Therefore,  OTIC  initiated  an  effort  to  identify  and 
isolate  the  special  requirements  of  these  smaller  DoD  libraries  and  accommodate  them.  A  software  survey 
was  performed  to  identify  packages  which  were  suitable  for  servicing  these  smaller  libraries  and  at  the 
same  time  were  compatible  with  the  gateway  software.  The  U.S.  Army  Training  and  Doctrine  Command  (TRADOC) 
library  was  selected  as  the  prototype  site  for  the  microccmputer-based  version  of  the  IBIS. 

Future  Plans.  The  results  and  experience  gained  during  the  test  and  evaluation  of  the  DNA  and  TRADOC 
prototypes  will  be  used  to  develop  the  specifications  for  a  competitive  procurement  of  a  production  system 
from  a  commercial  source.  The  production  system  will  be  available  for  purchase  and  installation  by  any 
library  on  the  STINET.  As  a  result  of  the  dual  approach  —  small  and  large  libraries  —  an  IBIS  product 
line  which  can  meet  the  needs  of  any  DoD  library,  regardless  of  collection  size  and  transaction  volume,  is 
expected  to  result.  This  product  line  is  expected  to  be  available  to  the  DoD  user  community  in  the  summer 
of  1988. 

The  IBIS  is  the  first  of  OTIC's  Local  Automation  Models  and  will  make  network-compatible  software 
available  for  local  installation  and  use.  The  IBIS  is  tailored  for  bibliographic  information,  arri  its 
target  community  is  DoD  libraries.  Later,  Local  Automation  Model  product  lines  can  be  tailored  for  other 
user  carmunities  and  different  types  of  data.  Tor  example,  models  tailored  for  numeric  data  may  be  a 
requirement  for  sane  communities.  Depending  on  the  software  selected  for  the  production  IBIS  and  the  non- 
bibliographic  user  requirements  identified,  it  may  be  possible  to  modify  the  IBIS  software  to  meet  addi¬ 
tional  needs.  If  not,  DTIC  can  develop  additional  network-compatible  local  automation  models  specifically 
for  an  identified  defense  requirement. 

Successful  development  and  deployment  of  the  DGIS  and  IBIS  will  provide  OTIC  with  a  powerful  product/ 
service  line  for  the  STINET.  Through  STINET,  users  would  have  a  mechanism  for  interconnecting  with  a 
virtually  endless  range  of  information  systems,  computing  resources,  and  people.  They  would  also  have  a 
network-compatible  vehicle  for  automating  and  managing  local  information,  selectively  sharing  that 
information  with  other  members  of  the  network,  and  analyzing  information  from  local  and  remote  resources. 

SearchMAESTRO 


Both  DGIS  and  IBIS  were  designed  to  deliver  the  power  and  utility  required  by  a  broad  section  of 
OTIC's  market.  But  part  of  OTIC's  market  did  not  require  the  full  power‘of  the  DGIS  and  the  IBIS.  This 
segment  consisted  of  end-users  who  wanted  to  do  some  of  their  own  information  gathering.  The  users  in 
this  market  segment  had  the  requirement  to  scan  literature  and  locate  relevant  items  in  their  area  of 
interest.  They  needed  an  interface  that  would  provide  easy  access  to  a  variety  of  databases.  In  seme 
cases,  the  users  would  not  be  familiar  with  existing  databases,  so  they  required  a  system  that  would 
select  a  database  for  them  and  guide  them  through  the  search  process.  The  information  needs  of  these 
users  tended  towards  fact  retrieval  —  such  as  the  latest  production  statistics  for  a  manufacturing 
company  —  or  scanning  for  relevant  items  —  such  as  what  newspaper  articles  have  been  written  on  a 
particular  subject.  Their  need  was  for  sane  information  relevant  to  the  subject  rather  than  an  exhaustive 
search  of  the  subject.  We  refer  to  them  as  casual  end-users. 

For  this  segment  of  the  conmunity,  OTIC  introduced  SearchMAESTRO.  As  mentioned  earlier,  Search- 
MAESTRD  provides  access  to  over  900  databases.  SearchMAESTRO  can  be  used  directly  or  through  the  DGIS. 
Users  who  access  SearchMAESTRO  directly  can  access  databases  via  a  simple-to-use  interface.  This  inter¬ 
face  eliminates  the  need  to  learn  unique  database  command  languages  and  search  techniques.  The  user  has 
two  modes  of  operation  from  which  to  choose.  In  the  first  mode,  SearchMAESTRO  leads  the  user  through  a 
series  of  questions  and  answers  and  selects  the  database  for  him.  An  example  of  how  this  works  is: 
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Password  accaptad 

Almost  Thar*.  B«  ptcunt. 
Finally  aon« .  Its  *douc  tn 


1  Saa rchHAESTRO  -  1 

2  SaarchMAESTAO  -  IX 

3  Dicion*  ditictocy 


wa  pick  Ch«  database 
lou  pick  cn«  database 


1  hom ,  Business.  Educational  use 

2  Research  a  Technical  information 
1  Telecommunication# 

H  Help 
->  2 


3  Place 

4  Organisation 

5  Government  Technical  Reports 
H  Help 


1  Current  Events 

2  Business,  Economics 

3  Computers .  Sci/Tecn,  Medicine 

4  Law,  Trademark*.  Patents 

5  Social  Sciences,  Education 

6  Art.  Literature,  Entertainment 
^  Religion,  Philosophy 

h  Help 


1  Research  end  popular  magetines 

2  General  periodicals 

3  Hagasinea  (rull-Text) 

4  Newsletters  on  electronics 

5  Books  on  computers 
(  Encyclopedias 

7  other  choices 
H  Kelp 


1  basic  searching 

2  field  searching 
H  Kelp 

->  1 


Agriculture 

Biology 

Computet,  engineering,  technology 
Earth  sciences,  energy 
Mathematics#  physics 
Medieina.  Allied  Health 
Help 


This  database  cacriea  a  surcharge 
Do  you  wish  to  continue  ?  (Yes/Noi 


VOICE  AMD  DIGIT/  TRAMSMI/ 


Enter  your  specific  topic  te.g..  VOICE  AMD  DIuiT/ 

(LIGHT/  OR  OPTIC/)  AMD  DATA  PROCESS/  WORDSTAR  OR  WORD  STAR 

PUNCTUATION.  Type  H  for  Search  Guidelines. 

(type  H  for  important  examples 
or  B  to  back  up) 

- >  robotics  and  expert  systems 


System  is  now  searching  the  Computer  Database,  copyrighted  1981  by  Inf  or-* 
nation  Access  Company,  Belmont,  CA.  and  available  through  Dialog  Inlormat ion 
Services.  Inc 


In  the  second  mode,  the  user  can  select  the  database  he  wants  and  use  the  SearchMAESTRD  interface  for 
executing  the  search.  With  either  mode,  the  user  can  view  search  results  on  the  screen,  or  print  or  save 
the  results  using  local  equipment.  The  main  reason  for  accessing  SearchMAESTRD  through  the  DGIS  is  to 
take  advantage  of  the  post-processing  and  electronic  mail  utilities.  The  users  must  always  observe  fair 
use  practices  when  dealing  with  copyrighted  material. 

The  unclassified  portions  of  the  Technical  Reports  and  the  Work  Unit  Information  System  files  of 
DTIC's  Defense  Research  Development,  Test  and  Evaluation  Online  System  (DRDLS)  are  being  made  available  to 
registered  users  through  SearchMAESTRD. 

Any  time  man  and  computer  meet,  a  diversity  of  problems  arise  which  can  best  be  handled  through  hunan 
intervention.  Therefore,  SearchMAESTRD  provides  online  user  assistance  through  a  function  called  "SOS" 
for  "Save  Our  Search."  At  any  point  during  a  SearchMAESTRD  session,  the  user  can  simply  enter  "SOS"  and 
a  search  expert  will  respond.  The  search  experts  are  trained  to  interpret  reference  questions,  be 
knowledgeable  about  available  sources  of  online  information,  and  know  how  SearchMAESTRD  works.  DTIC 
supplies  the  search  experts  for  the  DRDLS  files,  and  Telebase  Systems,  Inc.,  supplies  experts  for  all 
other  systems. 

SearchMAESTRD  is  currently  available  as  a  prototype  to  approximately  30  users.  These  users  access 
SearchMAESTRD  both  directly  and  through  the  DGIS.  The  SearchMAESTRD  service  will  be  offered  as  a  DTIC 
production  service  in  October  1987. 

SearchMAESTRD  is  just  the  first  of  several  end-user  interface  options  we  plan  to  offer  our  users. 

DTIC  has  sponsored  two  conferences  in  the  area  of  interface  technology  and  maintains  a  constant  alert  for 
new  offerings  (Ref.  16,  Ref.  17).  We  are  especially  interested  in  expert  system  and  personal  computer 
based  interfaces. 

Case  Conclusions 

Implementation  of  the  prototypes  described  in  this  case  study  demonstrates  the  feasibility  of 
integrating  diverse,  yet  functionally  compatible,  automation  and  information  resources  within  an  inter¬ 
operable  network.  It  also  demonstrates  that  a  variety  of  technologies  exist  today  —  software,  hardware, 
and  teleoonmunications  —  which  can  be  blended  and  modified  to  make  using  information  retrieval  systems 
easier  and  the  results  more  meaningful.  But  technology  alone  is  not  the  answer.  Advances  in  information 


retrieval  systems  are  dependent  on  people  —  end-users,  intermediaries,  vendors,  producers  —  working 
together  and  willing  to  modify  and  blend  their  skills  in  new  ways. 

OTIC  is  planning  for  the  evolution  of  the  STINET  from  prototype  applications  to  a  fully  functional, 
integrated  network.  It  has  been  said  that  "the  best  plan  is  only  a  plan  ...  unless  it  degenerates  into 
work"  (Ref.  18).  OTIC  is  positioning  itself  to  take  advantage  of  evolving  technologies  to  forward  STINET 
development,  but  there  are  no  quick  technological  solutions  in  STINET.  OTIC  must  work  to  build-on, 
tailor,  and  accelerate  the  introduction  of  new  technology.  At  the  sane  time,  we  must  work  to  effect 
changes  in  hunan  processes  so  that  the  fullest  benefit  can  be  reaped  from  application  of  new  information 
technologies.  The  STINET  plan  has  degenerated  into  workl  The  STINET  work  is  of  a  truly  synergistic 
nature  in  that  the  whole  is  much  greater  than  the  individual  technologies,  and  STINET  will  only  come  into 
being  through  the  combined  efforts  of  many  people. 

FORECAST 

Information  leads  to  knowledge,  and  knowledge  has  always  meant  power,  wealth,  and  control.  Technolo¬ 
gy  is  reshaping  the  ways  we  communicate,  distribute,  and  manage  information.  It  works  to  bring  informa¬ 
tion  closer  to  its  users.  Introduction  of  technologies  such  as  personal  computers  and  user  friendly  search 
tools  is  creating  a  massive  new  market  for  information  retrieval  systems.  During  the  next  five  years, 
there  will  be  a  restructuring  of  the  products  and  a  repositioning  of  the  players  in  the  information 
retrieved  market. 

Eeisy-to-use  interfaces  are  exposing  new  users  to  the  wealth  of  information  available  through 
information  retrieval  systems.  As  they  earn  learn  to  use  its  power,  the  perceived  value  of  information 
will  grow.  As  the  amount  of  information  continues  to  explode,  so  will  the  demand  for  information  tailored 
to  user  needs.  Gateway  systems  will  evolve  which  present  users  with  "views"  of  information  personalized 
to  meet  their  needs.  Information  "boutiques,"  organized  by  subject  and  interconnected,  will  form  a 
virtual  world-wide,  multi-media  library. 

Users  want  answers!  This  will  be  the  overriding  challenge  facing  the  information  retrieval  market. 
Decision  support  systems  that  incorporate  current  knowledge  and  can  aid  users  in  adding  to  that  knowledge 
base  will  be  the  goal.  As  the  generation  of  information  continues  unimpeded,  users  will  demand  sophist¬ 
icated  information  analysis  tools  and  instantaneous  fact  retrieval.  This  will  dictate  that  high-value 
information  be  available  in  digital  form  so  that  users  can  rapidly  access  facts  and  analyze  data. 

Local  systems  for  creating,  storing  and  processing  information  will  become  a  vital  part  of  the 
information  retrieval  market.  Selected  information  will  be  distributed  on  optical  media  for  local  use. 
Information  on  optical  media  will  often  be  part  of  a  complementary  product  line  which  includes  online 
services.  Users  will  be  connected  to  local  area  networks  that  will  encourage  information  sharing.  This 
will  swell  the  information  flow  and  make  it  even  more  difficult  to  pick  out  facts  or  trends.  It  will  also 
increase  the  need  for  information  specialists  and  experts. 

Information  specialists  will  be  the  gurus  of  the  new  age.  They  will  provide  expertise  in  the 
politics  of  information  and  will  be  invaluable  corporate  resources.  They  will  have  to  respond  to  users' 
critical  needs  for  fast,  accurate,  complete  information.  They  must  be  throughly  knowledgeable  about  the 
sources  of  information,  both  corporate  and  external.  They  must  understand  emerging  information  technology 
well  enough  to  offer  a  range  of  solutions  to  meet  the  needs  of  their  clientele.  This  may  include  building 
"views"  of  information  to  meet  individual  needs,  tailoring  interfaces  to  allcw  clientele  to  retrieve 
information  from  their  "views"  easily,  and  performing  ccrprehensive  searches  for  clients  when  exhaustive 
recall  is  essential.  Most  importantly,  information  specialists  will  be  called  upon  to  attest  to  the 
validity  and  reliability  of  data  sources.  They  will  have  to  ensure  that  external  resources  they  use,  or 
recommend  to  their  clients,  are  credible.  They  will  have  to  develop  policies  concerning  how  locally 
generated  information  will  be  controlled  and  validated.  This  will  be  critical;  the  use  of  incorrect 
information  for  decision  making  and  planning  could  prove  disasterous  to  an  individual  or  to  an 
institution. 

The  evolution  of  optical  disk  storage,  computers  and  interface  software  provides  the  opportunity  for 
rapid  advances  in  information  retrieval  systems,  but  we  are  still  a  long  way  from  having  the  world  of 
information  on  a  microchip  at  our  fingertips.  The  information  market  will  evolve  in  that  direction  as 
end-users,  information  specialists,  information  producers  and  vendors  continue  their  partnership  in  the 
quest  for  knowledge. 
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Atlantic  Richfield  has  been  successfully  operating  an 
interactive,  full-motion  video  teleconferencing  system 
since  September  of  1983.  The  system,  named  ARCOvision, 
is  for  internal  use  only  and  consists  of  six  locations 
in  the  United  States  including  Alaska.  The  implementation, 
capabilities,  equipment,  and  costs  related  to  the  ARCOvision 
system  will  be  discussed  in  this  paper. 


In  September  of  1983  Atlantic  Richfield  introduced  a  live  full-motion  video- 
conferencing  system  to  employees  in  Anchorage,  Alaska;  Dallas  and  Houston,  Texas;  Los 
Angeles,  California;  Philadelphia,  Pennsylvania  and  Washington,  D.  C.  Using  its  own 
satellite-based  communications  network  to  transmit  full-motion  video  images  between  company 
headquarters,  the  meeting  facilities  allow  ARCO  employees  the  opportunity  to  meet  their 
distant  colleagues  face-to-face  without  leaving  their  office  building.  The  system  has 
been  named  ARCOvision  allows  participants  to  view  color  life-size  images  of  all  their 
counterparts  at  one  time  -  as  if  they  were  sitting  across  the  conference  table  rather  than 
across  the  country. 

The  ARCOvision  system  was  made  possible  by  an  enhancement  of  the  ARCOnet  system, 
the  backbone  of  ARCO's  communication  system.  ARCOnet  is  one  of  the  largest  privately 
owned  and  operated  telecommunications  systems.  As  part  of  the  ARCOnet  enhancement  we 
participated  in  the  building  of  four  and  have  leased  two  satellite  earth  stations.  We 
have  leased  a  full  transponder  on  Alascom's  Aurora  satellite.  The  system  is  completely 
digital  and  uses  one  of  the  most  sophisticated  multiplexing  and  polling  techniques 
developed.  It  supports  all  of  our  voice,  data,  and  video  transmissions.  The  all-digital 
system  gave  us  capabilities  that  were  never  economical  before.  The  digital  system  gives 
us  the  ability  to  transmit  video  through  very  sophisticated  compression  algorithms 
and  hardware. 

The  first  question  often  asked  is  "Why  do  we  use  videoconferencing?”  In  our  case, 
ARCOvision  came  about  because  our  former  Chairman  of  the  Board,  Mr.  R.  O.  Anderson,  told 
Business  Week  that  there  had  to  be  a  better  way  for  our  business  to  communicate.  In  1980 
he  stated  that  ARCO  would  lead  the  way  with  sophisticated  videoconferencing.  In  order  to 
give  you  an  idea  of  why  we  went  into  this  some  facts  are  relevant;  In  1980,  the  year  we 
had  several  tens  of  thousands  employees,  ARCO  spent  some  60  million  dollars  on  travel  and 
had  more  than  80  thousand  business  trips.  While  the  co3t  in  actual  transportation  dollars 
is  high,  the  cost  in  human  terms  is  even  higher.  Booz  Allen  and  Hamilton  did  a  study  of 
business  practices  in  America  and  the  results  indicated  that  there  was  a  one-to-four 
productivity  loss  ratio  related  to  business  travel.  Thi3  means  that  for  every  hour 
spent  in  a  meeting  to  which  one  travels,  on  the  average,  there  were  four  hours  spent  in 
just  the  logistics  in  getting  there. 

Once  the  decision  to  institute  videoconferencing  was  made,  ARCO  conducted  a  survey 
of  more  than  1000  of  its  managers  and  professionals  asking  them  how  they  perceived  their 
needs  for  videoconferencing.  The  survey  was  conducted  by  the  CCRG  group  from  USC 
Annenberg  School  of  Communications.  They  came  with  a  prepared  questionnaire.  It  sought 
to  profile  the  user  communities,  communications  patterns,  meeting  habits,  preconceptions, 
business  attitudes  by  the  company,  city  and  job  classifications.  As  it  turned  out  it 
captured  our  culture.  Also  as  a  part  of  the  USC  study,  it  was  determined  the  key- 
operation  cities  that  would  be  linked  by  this  system.  From  that  survey  came  the  concept 
of  what  ARCOvision  and  the  conference  room  must  be.  We  knew  at  that  time  that  it  needed 
to  be  a  full-motion,  life  size,  open  mike  system.  It  must  allow  eye  contact  and  have 
several  presentation  capabilities.  The  facility  had  to  have  full-screen  capabilities, 
cameras,  mikes,  chalkboard  capability,  lighting,  graphics  panels,  and  the  ability  to 
project  slides. 

As  far  as  we  know,  ARCOvision,  was  the  first  long-distance  system  tailored  specifically 
to  the  needs  and  culture  of  our  company.  In  addition,  it  was  the  first  nationwide  in-house 
network.  The  rooms  are  equipped  with  the  full  range  of  graphics  materials  typically  used 
in  ARCO  business  meetings  -  slides,  viewgraphs,  and  chalkboards.  Each  identical  ARCOvision 
room  is  comfortable  and  easy  to  use.  Participants  appear  life  size  in  full  color  with 
clear  audio  transmission.  The  computer  controlled  color  cameras  permit  close  examination 
of  charts,  slides,  or  any  other  object  in  the  room  while  still  maintaining  a  presence  of 
the  participants.  The  syst  m  allows  eye  contact  and  has  several  presentation  capabilities. 
For  example,  the  multiple  video  display  options  afford  users  either  a  wide  screen  view  of 
all  participants  in  the  remote  location  or  a  voice-actuated  one  shot  of  just  the  person 
speaking  in  the  other  room.  The  advantage  of  the  wide  screen  view  is  that  participants 
never  lose  contact  with  each  other  and  they  are  able  to  pick  up  visual  cues  through  body 
language  as  they  would  in  an  in-person  meeting.  High  speed  facsimile  hook-ups  between 
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all  ARCOvision  rooms  make  it  easy  to  distribute  charts,  text,  or  graphics  to  the  other 
location.  The  system  has  very  simple  controls.  It  is  very  user  friendly  and  allows  our 
people,  with  a  minimum  amount  of  training,  to  feel  very  comfortable  with  the  operation. 

We  found  that  if  one  keeps  the  system  very  friendly,  and  if  there  is  a  real  dialog  during 
the  videoconference,  that  after  about  five  minutes  people  forget  all  about  the  technology. 
Because  up  to  ten  people  at  each  location  can  participate  in  the  videoconference  the 
entire  project  team  can  attend  the  meeting  increasing  productivity  and  saving  time. 

Our  ARCOvision  rooms  are  used  by  all  levels  of  management  and  require  secure 
transmission  for  highly  sensitive  material.  The  room  is  isolated;  both  the  audio  and 
video  signals  are  scrambled  before  they  leave  the  room  and  again  when  they  reach  the 
satellite  earth  station.  The  word  "codec"  can  be  broken  down  into  its  two  component  words, 
code  and  decode.  The  coding  or  compression  takes  place  according  to  the  algorithm  chosen 
requiring  less  bandwidth  for  cost  efficiency.  An  added  feature  is  the  ability  to  enter 
a  specially  chosen  alpha/numeric  code,  an  encryption  code,  at  the  sending  and  receiving 
location  only.  A  particularly  cautious  user  may  even  set  his  own  encryption  code  moments 
before  the  meeting  starts  to  be  certain  his  communication  cannot  be  intercepted.  The 
signal  is  encrypted  and  scrambled  before  it  leaves  the  conference  location  on  the  way  to 
the  earth  station.  Once  the  signal  reaches  the  earth  station  it  is  scrambled  again  before 
being  transmitted  to  the  satellite.  We  believe  our  system  is  very  secure  and  it  is  being 
used  for  high  level  negotiations. 

Our  system  is  four  years  old;  it's  expanded  to  include  23  U.S.  cities  plus  Canada 
and  Europe.  We  arrived  at  this  expansion  not  by  building  additional  rooms  for  our  private 
use,  but  by  negotiating  with  AT&T  and  the  Meeting  Channel.  There  are  three  ingredients 
required  for  videoconferencing  -  some  way  of  presenting  the  signal,  a  Codec  to  compress 
the  signal  and  a  means  to  transmit  the  signal.  Compression  of  the  signal  requires  the 
use  of  less  bandwidth  making  the  concept  cost  effective.  There  are  a  variety  of  compression 
equipment;  codecs,  available.  Three  of  the  codecs  that  ARCO  uses  are  NEC  (Nippon  Electric 
Company),  GEC  Video  Systems,  and  CLI  (Compression  Labs  Incorporated).  ARCO  chose  to  use 
the  Nippon  Electric  Company  (NEC)  codec  and  all  of  the  ARCOvision  rooms  are  equipped  with 
NEC  codecs.  The  problem  encountered  is  that  the  different  codecs  manufactured  are  not 
able  to  communicate  with  one  another.  Therefore  unless  we  use  the  same  codec  our  video- 
conference  centers  could  not  ccmmur.icata  with  yours.  At  ARCO  we  have  overcome  this 
communication  problem  two  different  ways.  An  additional  codec,  a  GEC  codec,  was  purchased 
and  is  installed  in  the  Philadelphia  room.  This  makes  the  Philadelphia  location  a  gateway 
for  any  additional  services,  usually  international,  that  use  a  GEC  codec.  For  meetings 
originating  in  Philadelphia  the  signal  is  routed  to  the  GEC  codec.  For  meetings 
originating  in  one  of  the  other  ARCOvision  locations  the  signal  is  routed  to  the 
Philadelphia  room  and  then  enters  the  GEC  codec  to  the  international  destination. 

(The  Philadelphia  room  also  has  access  to  AT&T  Accunet  Reserve  system  that  serves  as  the 
transmission  medium  for  international  meetings.) 

U.S.  Sprint  and  their  conference  centers  called  "The  Meeting  Channel"  use  the  CLI 
codec  and  in  this  case  the  Los  Angeles  room  acts  as  a  gateway.  Differently  in  this  case, 
the  signal  is  transmitted  by  wire  from  the  ARCOvision  room  on  the  37th  floor  to  the 
Meeting  Channel  room  in  the  lower  concourse  of  the  same  building.  There  it  is  put  into 
the  CLI  codec  and  transmitted  to  the  chosen  location  on  the  Meeting  Channel  network. 

The  Meeting  Channel  provides  connections  to  Japan  and  Canada. 

The  conference  locations  listed  above  cover  many  time  zones.  A  system  had  to  be 
implemented  so  that  the  requested  time  would  translate  to  the  proper  local  time.  This 
was  overcome  with  the  use  of  Greenwich  Mean  Time  (GMT) ,  the  prime  basis  of  standard  time 
throughout  the  world.  The  use  of  Greenwich  Mean  Time  enables  us  to  schedule  conferences 
and  it  translates  the  time  to  the  correct  local  time  no  matter  what  time  zone. 

Training  is  also  key.  When  people  are  interested  in  using  the  system,  our  conference 
coordinator  will  provide  them  with  an  opportunity  to  experiment,  learn  the  controls,  and 
actually  talk  to  another  city  just  to  get  a  feel  of  how  the  system  works.  Each  ARCOvision 
room  is  run  by  a  full-time  coordinator,  whose  duties  include  scheduling  the  facilities, 
customer  training,  administrative  support,  equipment  maintenance,  first-level  technical 
troubleshooting,  and  internal  marketing.  We  estimate  that  these  employees  spent  60%  of 
their  time  on  daily  operations;  20%  on  maintenance;  10%  on  marketing  and  10%  on 
administrative  duties. 

In  addition,  since  this  is  an  in-house  system,  we  have  a  Network  Operations  Center 
located  in  Los  Angeles,  which  is  staffed  24  hours  a  day  and  which  provides  technical 
services  for  the  satellite  system,  both  to  our  videoconferencing  staff  as  well  as  other 
customers  leasing  time  from  ARCO  for  their  voice  and  data  transmissions. 

As  for  ARCOvision,  we  spent  some  time  evaluating  the  cost/benefit  trade-offs  of 
videoconferencing  and  found  that  we  had  a  very  favorable  return  if  only  15%  of  the  travel 
were  reduced  between  cities  served  by  the  new  system.  In  1980,  the  baseline  year  for  our 
study,  we  spent  more  than  60  million  dollars  on  52,000  trips  to  various  company  locations. 
ARCOvision  was  planned  to  serve  the  cities  which  accounted  for  more  than  half  that  expense. 
So  you  see,  relative  to  the  "enormous  costs"  of  travel,  ARCOvision  is  quite  economical. 

It  should  be  noted  that  we  never  intended  the  system  to  replace  all  business  travel. 

In  general,  we  expected  it  to  supplement,  rather  than  supplant,  necessary  business  trips. 
However,  we  also  believe  that  there  are  many  meetings  which  could  be  conducted  as 
effectively  using  this  medium  as  using  the  old  -  an  airplane  trip  -  and  have  found  that 
to  be  the  case. 
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Some  of  the  intangible  benefits  of  ARCOvision  is  better  utilization  of  manpower  and 
quicker  decision  making.  The  ability  to  bring  more  people  easily  together  to  be 
involved  in  the  decision  making  and  information  sharing  process  builds  better  morale 
and  team  spirit. 

In  1986,  we  saved  something  approaching  2500  work  days.  This  translates  into  a 
man-day  savings  of  about  $700,000.  In  the  same  year  ARCOvision  experienced  790  system 
hours  of  use  and  a  total  travel  savings  of  approximately  $1,300,000.  The  combined  savings 
through  the  use  of  ARCOvision  is  approximately  $2  million.  This  benefit,  while  dramatic, 
really  does  not  capture  all  of  the  true  benefits  of  a  system  like  this.  We  charge, 
internally  $480.00  an  hour  for  the  usage  of  the  system.  When  we  use  it  properly  we 
estimate  we  are  saving  two  thousand  dollars  an  hour. 

The  cost  of  the  network  is  justified  on  a  cost-avoidance  basis  -  we  save  about  $8 
million  a  year  by  having  our  own  communications  facilities.  In  addition,  we  have  the 
advantages  of  security  and  control.  The  first  year  of  existence  there  were  no  charge- 
backs.  The  second  year  we  started  charging  $350  per  hour  and  the  third  year  $480  per  hour. 
This  does  not  recoup  our  operational  costs  for  the  system,  which  are  at  about  $650  per 
hour;  however  it  does  increase  cost  awareness  by  our  employees. 

In  addition  to  our  internal  costs,  there  are  those  associated  with  our  interconnect 
to  international  locations  and  the  Meeting  Channel.  Our  corporation  has  negotiated  reduced 
rates  with  these  organizations.  Costs  are  being  figured  on  an  as-needed  basis;  as 
both  of  these  organizations  are  in  the  process  of  reducing  their  rates.  However,  as  an 
example,  if  we  did  a  videoconference  from  Houston  to  Paris  the  cost  would  be  in  the 
neighborhood  of  $2500  per  hour.  An  Anchorage  to  San  Francisco  would  be  about  $700  per  hour. 

We  are  proud  of  our  videoconferencing  system.  We  really  believe  it  provides  benefits 
in  terms  of  productivity,  in  terms  of  improved  participation,  in  terms  of  shortening  the 
decision-making  cycle.  And  it  saves  dollars.  All  of  the  things  we  set  out  to  do,  we  feel 
we  have  accomplished.  We  believe  that  the  original  vision  provided  by  Mr.  R.  0.  Anderson 
has  been  taken  into  account.  As  a  company,  we're  going  the  distance  in  providing 
tomorrow's  decisions  today.  Videoconferencing  helps  accomplish  that. 
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xAbstract 

Versts  can  create  or  resolve  barriers  in  the  processes  pertaining  to  the  transfer  of 
knowledge  and  the  access  to  information.  Selecting  words  to  organize,  process  and 
retrieve  data  leads  to  the  establishment  of  hierarchies.  The  building  of  thesauri 
provides  for  tools  enabling  the  organization  of  concepts.  National  and  international 
guidelines  have  been  developed  for  the  establishment  of  monolingual  and  multilingual 
thesauri . 

New  words  find  their  way  in  solid  hierarchies.  The  introduction  mecanisms  for  new 
words  rely  on  cross-referencing  to  already  established  and  recognized  terms.  Manual 
and  machine  indexing  provide  the  same  basic  tools,  machine  Indexing  being,  in  many 
cases,  more  feasible. 

Standardized  terms  are  compiled  in  special  listings  such  as  the  glossaries  prepared 
and  published  by  NATO.  The  linguistic  quality  norms  of  standardized  terminologies 
are  limited  to  the  specialized  objectives  of  such  terminologies. 

Terminological  barriers  are  costly  and  should  therefore  be  reduced  in  order  not  to 
hinder  the  transfer  of  knowledge  and  access  to  information  data. 


Words:  Keys  or  Barriers  to  Information  Transfer 


History  shows  that  man  “r"erienced  information  transfer  barriers  as  early  as  he  began 
organizing  knowledge.  The  arrangement  of  the  first  encyclopaedias  produced  in  the 
17th  century,  in  France  and  in  England,  illustrate  this  fact.  Larousse  inclined  towards 
the  alphabetical  arrangement  gave  in,  in  fact,  to  his  predecessors  who  aimed  at  classi¬ 
fying  knowledge  and  the  concepts  and  words  relating  to  the  realities  of  their  time 
according  to  broad  categories.  The  popularity  of  the  alphabetical  dictionary  and 
encyclopedia  favored  the  recognition  of  new  words;  updating  a  reference  tool  such  as 
an  alphabetical  dictionary  became  a  much  easier  task  and  proved  also  to  be  a  good 
investment.  Quantity  and  quality  were  not  necessarily  Interlocked  criteria. 

The  word  "thesaurus"  was  first  popularized  by  Peter  Mark  Roget  with  the  publication, 
in  1852,  of  his  Thesaurus  of  English  Words  and  Phrases.  Organized  in  terms  of  words 
related  to  concepts,  his  thesaurus  pointed  the  way  for  all  modern  thesauri.  The  adap¬ 
tation  of  a  thesaurus  to  an  information  retrieval  system  was  first  referred  to  in  a„ 
IBM  Journal  in  1957  whereas  the  first  thesaurus  actually  created  for  controlling  the 
vocabulary  of  an  information  retrieval  system  was  developed  around  1959  by  the  DuPont 
Company . 

In  1962,  the  Educational  Resources  Information  Center  (ERIC)  developed  an  information 
retrieval  system  requiring  the  building  of  a  huge  thesaurus.  In  1966,  guidelines  and 
conventions  managing  the  selection  of  appropriate  technical  subject  indexing  and  re¬ 
trieval  terminology  as  well  as  the  display  of  this  terminology  in  a  thesaurus  format 
were  outlined  in  the  US  Department  of  Defence  Manual  for  Building  a  Technical  Thesaurus. 
These  conventions  were  developed  by  Project  LEX,  the  DOD  Technical  Thesaurus  Task  Force, 
in  cooperation  with  the  Engineers  Joint  Council.  This  Manual  focuses  on  the  criteria 
for  determining  the  usefulness  of  prospective  thesaurus  terms  or  descriptors,  on  the 
means  of  resolving  ambiguities  among  descriptors  and  on  the  methods  for  systematically 
creating  a  cross  reference  structure  that  would  disolay  hierarchical  and  conceptual 
interrelationships. 

Most  thesaurus  formats  are  developed  according  to  basic  organization  rules  governing 
not  only  the  corpus  but  most  Important  the  access  keys  allowing  for  the  indexing  or 
classification  of  concepts  and  essentially  the  retrieval  of  the  data  or  information 
processed  according  to  the  thesaurus.  Basically,  all  thesauri  should  comprise  an 
introductory  statement  explaining  the  purpose  of  the  thesaurus  and  its  arrangement, 
an  alphabetical  listing  of  all  descriptors  and  cross  references,  a  corpus  of  descriptors 
arranged  by  subject  categories  based  on  a  subject  category  list,  a  graphic  display 
of  hierarchical  relationships  and  finally  a  permuted  display  of  all  descriptors  in 
the  order  of  each  meanlngfull  word.  These  structural  elements  garantee  the  quality 
of  the  thesaurus  system  and  minimize  the  linguistics  and  terminological  barriers  with 
regard  to  information  processing  and  transfer  of  knowledge. 

In  1970,  an  International  Conference  on  the  Ceneral  Principles  of  Thesauri  Building 
was  held  in  Warsaw  under  the  auspices  of  the  Polish  Academy  of  Sciences.  The  aim  of 
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the  conference  was  to  hold  a  working  discussion  leading  to  the  identification  of  pro¬ 
blems  relevant  to  the  building  of  thesauri  in  order  to  develop  a  uniform  comprehension 
of  this  building  process  to  ensure  easier  communication  and  exchange  of  information 
between  information  systems  covering  different  ranges  of  subject  matter,  and  using 
different  languages.  Discussions  centered  on  the  meanings  of  terms  such  as  "thesaurus", 
"descriptor"  and  "ascr iptor " ;  the  participants  also  focused  on  the  fundamental  elements 
of  thesaurus  arrangement  and  special  attention  was  given  to  the  methodology  of  thesaurus 
building,  the  selection  and  quality  of  descriptors  and  thus  the  interrelations.  This 
international  gathering  of  experts  in  thesaurus  building  also  assessed  the  problems 
relevant  to  the  building  of  monolingual  thesauri  and  polylingual  thesauri. 

Unesco's  Guidelines  for  the  Establishment  and  Development  of  Monolingual  Scientific 
and  Technical  Thesauri  for  Information  Retrieval,  published  in  1970  emphasizes  the 
needs  for  standards  in  this  activity. 

Unesco  also  published,  in  collaboration  with  ISO,  the  Guidelines  for  the  Establishment 
and  Development  of  Multilingual  Thesauri  in  1976  and  revised  in  1980.  This  standard 
reviews  the  new  kinds  of  problem  posed  by  multilingualism  most  of  which  are  concerned 
with  the  establishment  of  equivalences  between  terms  in  different  languages  and  allows 
for  a  range  of  options  for  problem  solving  based  on  the  fact  that  a  standard  for  multi¬ 
lingual  thesauri  can  never  be  prospective. 

ASTUTE  (Automated  System  for  Thesaurus  Updating,  Testing  and  Editing)  was  developed, 
10  years  ago,  to  produce  multilingual  thesauri  for  EURONET.  Those  already  published 
thesauri  have  proven  most  efficient  in  machine  translation. 

The  evolution  of  thesaurus  building  comprises  the  use  of  technological  tools.  Thesaurus 
construction  relies  on  mini-computer  applications  in  many  cases.  The  mini-computer 
is  used  to  collect,  edit  and  correct  candidate  thesaurus  terms  and  therefore  eases 
the  process  of  grouping  terms  into  files  of  similar  concepts  and  facilitates  the  gene¬ 
ration  of  products  useful  in  vocabulary  review  and  in  term  structuring.  Because 
thesaurus  terms  are  already  in  machine-readable  form,  it  is  simple  to  prepare  print 
programmes  to  provide  permuted,  alphabetic,  hierarchical  and  chart  formatted  term 
displays.  The  use  of  the  mini-computer  facilitates  initial  thesaurus  entry  development 
by  reducing  clerical  effort,  editorial  staff  aecisions  and  overall  processing  time. 

Classification  schemes,  subject  headings  lists  and  thesauri  are  tools  developed  to 
control  vocabularies  of  informaMon  storage  and  retrieval  systems.  There  are  many 
basic  features  that  are  common  to  both  classification  schemes  and  thesauri.  The  major 
difference  is  that  in  thesauri,  words  denote  both  concepts  and  ranking  whereas  in 
classification  schemes,  notation  denotes  ranking. 

The  selection  of  descriptors  for  both  thesauri  and  classification  schemes  building 
has  arouse  many  theories  among  librarians,  information  specialists  and  terminologists . 
There  are  basically  two  different  strategies  for  identifying  candidate  terms.  A  con¬ 
sensus  of  experts  of  a  field  (the  committee  approach)  is  different  from  the  knowledge 
expressed  in  the  literature  in  this  field  (emperical  approach).  A  number  of  research¬ 
ers  have  studied  the  validity  of  this  assumption  only  to  conclude  that  the  two  stra¬ 
tegies  failed  to  generate  significantly  different  lists  of  terms  and  thus  raising 
Important  questions  for  those  involved  In  establishing  guidelines  for  thesaurus  design 
and  foremost  for  those  implementing  the  guidelines. 

In  automatic  indexing,  the  frequency  characteristics  of  terms  in  the  documents  of 
a  collection  have  been  used  as  indicators  of  term  importance  for  content  analysis 
and  indexing  purposes.  In  particular  very  rare  or  very  frequent  terms  are  normally 
believed  to  be  less  effective  than  medium-frequency  terms.  More  recent  automatic 
Indexing  theories  have  been  devised  to  use  not  only  the  term  frequency  characteristics 
but  also  the  relevance  properties  of  the  terms.  The  term  precision  and  term  utility 
weights  that  are  based  on  the  occurrence  characteristics  of  the  terms  relevant  (as 
opposed  to  the  nonrelevant)  documents  of  a  collection  are  introduced  and  methods  are 
suggested  for  estimating  the  relevance  properties  of  the  terms  based  on  their  overall 
occurrence  characteristics  in  the  collection.  An  evaluation  assessing  the  results 
of  the  weighting  systems  using  the  term  relevance  properties  with  the  more  conventional 
frequency-based  methodology  determines  the  measurement  of  term  importance  in  automatic 
indexing  systems. 

At  present,  three  kinds  of  automatic  indexing  systems  are  available:  (1)  inverted 
file  full  text  indexing,  (2)  dictionary-based  indexing  and  (3)  dictionary  and  rule- 
based  indexing.  The  only  method  which  is  fully  automatic  is  still  the  full  text 
indexing . 

In  J  982 ,  the  British  Library  conducted  a  study  of  free  language  vocabulary  analysis 
especially  oriented  towards  the  automatic  generation  of  indexing  elements  for  document 
and  query  representation  and  their  effectiveness  in  mechanized  information  retrieval. 
Automatically  derived  sets  of  index  terms  were  evaluated  for  their  performance  in 
computerized  retrieval  queries  and  conclusions  confirm  that  performance  of  searches 
on  the  automatically  generated  free  index  terms  is  comparable  to  searches  on  manually- 
assigned  free  index  terms  in  the  low  recall-high  precision  range.  High  recall  was 
possible  with  the  automatically  assigned  subject  headings  and  classification  but  with 
poor  precision.  Cost  studies  showed  that  the  automatic  system  for  generating  indexing 
terms  was  less  costly  than  the  manual  system. 
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In  selecting  terms,  a  review  of  all  candidate  descriptors  must  be  conducted  to  avoid 
proliferation,  maintain  consistency  and  clarify  any  potential  ambiguities  with  already 
established  and  accepted  terms.  It  is  therefore  not  a  simple  exercise  to  justify 
new  terms. 

At  this  stage,  a  distinction  should  be  made  between  "terms"  and  "identifiers"  or 
"names"  such  as  specific  projects  (Brave  Lion,  "Big  Lift"  Operation),  geographic  loca¬ 
tions  (Camp  David,  SACLANT  (Supreme  Allied  Commander  Atlantic),  Anti-Submarine  Warfare 
(ASW)  Research  Centre),  persons  (Secretary  General  NATO  Supreme  Allied  Commander), 
trade  names  (SIDEWINDER  (nuclear  missile)),  programmes  (NATO  Sea  Sparrow,  NATO  Patriot 
Programme),  legislation  (Brussels  Treaty,  Vandenberg  Resolution),  organizations  (Al¬ 
lied  Communication  Security  Agency,  Central  Europe  Pipeline  Office),  equipment  (air- 
delivered  weapons,  F-I6  air  combat  fighter),  acronyms  (NATO,  AGARD). 

New  terms  should  be  added  to  solid  hierarchies,  logical  synonyms  and  contemporary 
terminology.  Terminological  barriers  are  avoided,  possibly  not  solved,  by  cross-refer¬ 
encing  and  linking  new  terms  to  established  and  recognized  hierarchies.  A  continuous 
updating  and  sorting  process  adds  to  the  feasibility  and  efficiency  of  this  approach, 
especially  as  far  as  managing  a  thesaurus  and  a  terminology  database.  The  late  intro¬ 
duction  of  new  terms  tends  to  favour  the  creation  of  parallel  and  ambiguous  termino¬ 
logies.  This  again  can  be  avoided  by  cross  referencing  and  linking  new  terms  to 
existing  hierarchies. 

A  new  term  should  always  be  fully  documented  according  to  reliable  documentary  and 
bibliographic  sources  in  order  to  avoid  ambiguities.  The  term  should  also  be  analyzed 
in  its  proper  context  in  order  to  link  it  to  the  category  of  hierarchy  to  which  it 
belongs.  At  this  point,  cross-referencing  should  be  used  to  clarify  interrelations, 
inclusions  and  exclusions. 

In  military  terminology,  "a  service  truck"  is  just  a  truck  whereas  from  a  civilian 
point  of  view  "a  service  truck"  is  a  truck  equipped  for  emergencies  and  routine  ser¬ 
vicing  used,  for  example,  by  telephone  and  hydro  companies  and  others. 

F.opcr  HK’nagoment  of  sp»cialized  terminologies  through  glossaries,  vocabularies  and 
other  lists  of  terms,  maintained  by  either  manual  or  computerized  means,  offer  an 
economical  and  feasible  approach  to  the  reduction  of  terminological  barriers.  One 
of  the  better  known  examples  is  without  any  doubt  the  Dictionary  of  Military  and 
Associated  Terms  published  by  the  Joint  Chiefs  of  Staff,  in  1979,  in  Washington. 

DOD  dictionaries  and  glossaries  (as  well  as  NATO's)  are  published  for  mandatory  use 
by  DOD  and  NATO  management.  Terms  which  are  identified  as  DOD,  for  example,  are  not 
otherwise  defined,  unless  a  different  application  is  Intended.  Definitions  which 
identify  a  different  application  are  to  be  submitted  to  a  terminology  committee  for 
consideration  and  approval.  In  the  Department  of  Defense  Dictionary  of  Military  and 
Associated  Terms,  terms  and  definitions  identified  with  "NATO’1  and  "CENTO" ,  represent 
the  US  official  position  in  international  standardization  agreements  and  in  order 
to  ensure  the  Integrity  of  these  commitments,  terms  so  designated  are  to  be  used  by 
US  members  of  working  parties,  groups  or  panels  of  experts  in  the  processing  of  all 
standardization  agreements.  When  an  agreed  international  term  does  not  exist,  the 
DOD  term  will  be  used  in  establishing  the  United  States  position. 

The  standardization  mandate  regulates  also  the  production  of  separate  dictionaries 
and  glossaries  containing  terms  and  definitions  for  joint  usage  which  cannot  be  pub¬ 
lished  within  DOD  without  prior  coordination  and  approval  by  the  authorized  officers. 

An  extremely  Important  note  comments  on  the  editorial  objectives  of  the  DOD  dictionary 
by  simply  indicating  that  "...  the  DOD  dictionary  is  not  to  be  used  as  an  editorial 
and  style  guide...". 

Standardized  terminologies  follow  the  same  type  of  processing.  It  is  important  to 
underline,  at  this  point,  that  standardized  terminologies  are  not  necessarily  of  high 
linguistic  quality.  The  objectives  set  by  term  banks  of  standardized  terms  allow  for 
limited  linguistic  quality  unless  the  standards  from  which  the  terms  are  extracted 
were  prepared  and  written  by  teams  of  experts,  cautious  of  both  technological  and 
linguistic  norms . 

Specialists  and  experts  create  new  terms  reflecting  the  tdvancement  of  their  discipline 
and  field  of  research.  These  new  words  make  their  ways  in  papers  presented  at  scien¬ 
tific  and  technical  conferences  and  in  publications.  Once  created  and  fed  into  a 
network,  regional,  national  or  International,  those  new  words  are  picked  up,  used 
and  disseminated. 

Whether  the  new  words  are  linguistically  good  according  to  established  linguistic 
quality  criteria,  they  are  used  and  therefore  exist.  In  recording  new  terms,  one 
has  to  establish  a  profile:  history  and  genesis  of  the  new  word,  aeuninuu  and 
context,  sources  quoting  the  new  words,  interrelations  to  existing  terms,  comparison 
of  definitions,  contexts,  synonyms,  etc.  The  terminological  record  has  to  be  docu¬ 
mented  and  exhaustive. 


(Examples  extracted  from  The  North  Atlantic  Treaty  Organizations:  Facts  and  Figures. 
Brussels,  NATO  Information  Service,  1981). 
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Terminological  barriers  will  always  hinder  the  transfer  of  knowledge  and  the  access 
to  Information.  Terminological  barriers  are  costly;  their  Impact  on  communication 
systems  of  various  nature  has  to  be  taken  Into  account.  The  Tower  of  Babel  could 
have  easily  been  the  first  term  bank. 
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LA  BARRIERE  LINGUISTIQUE  :  PROBLEMES  DE  TRADUCTION 


I.  CLEMENS,  Ir.  civil  ^lectr. ,  Chef  du  Centre  Beige  de  Traduction, 
CNDST-CBT ,  4  Bid  de  1 ’Empereur  B-1000  Bruxelles,  Belgique. 


RESUME 

Les  probiemes  1 inguist iques  sont  ^troi tement  1  i£s  aux  probl^mes  d’in- 
formations.  Ils  sont  done  examines  con jo i ntement  afin  de  r^soudre  les 
probl^mes  terminologiques  et  d’acc^der  £  la  masse  documenta i r e .  On 
examine  les  probiemes  de  comprehension  des  textes  par  le  special iste 
et  les  difficultes  rencontrees  par  le  traducteur.  Les  probiemes 
1 i ngu i st i ques  sont  examines  en  fonction  des  par t i cu 1 ar i tes  des  pays 
unilingues  et  multilingues  et  en  fonction  des  groupes  de  langues.  Les 
moyens  existants  pour  reduire  la  barriere  linguistique  et  les  insti¬ 
tutions  impliquees  sont  considers  du  point  de  vue  de  1  ’  ut  i  1  i  sateur 
et  du  traducteur.  Le  traducteur  est  le  per sonnage-c 1 e  pour  1 ’aboli¬ 
tion  de  la  barriere  linguistique.  II  est  confronts  aux  probiemes  de 
comprehension  de  textes  sc i ent i f iques  et  techniques,  aux  probiemes  de 
recherche  de  sources  d’ information,  de  termes  techniques  ainsi  qu’aux 
probiemes  d’adaptation  aux  methodes  modernes  de  travail.  L’emploi  de 
1 ’ informat ique  lui  procure  une  aide  certaine  et  de  multiples  possibi- 
1 ites  de  rationalisation  du  processus  de  traduction.  Parmis  les  modes 
de  traductions  connus,  la  traduction  assistee  par  ordinateur  paratt 
actuellement  la  plus  performante  :  utilisation  des  PC  pour  le  proces¬ 
sus  de  traduction  proprement  dite,  emploi  de  d i c t i onna i r es  informat i- 
ses  et  de  bases  de  donn^es  b ib 1 iograph iques  en  tant  que  sources 
terminologiques  multilingues  fiables  et  actual is^es.  L’ensemble  des 
mesures  £vaqu£es  permet  de  reduire  les  difficultes  1 i ngu i st i ques  du 
cansommateur  de  1 ’ information  et  d’assurer  une  aide  efficace  au 
traducteur . 


I  *  La  barri|re  IiQgyi.lt  igue  l^ientif  igue  et 

£.§g.tiDisy?  ngderne 

1.  Obtention  des  documents  en  texte  integral  et  extraction 
de  1 ’ information 

a)  d i sponib i 1 i t£  des  documents 

b)  contacts  avec  les  d£tenteurs  d ' informat  ion 

2.  Probiemes  1  i  ngu i s t i ques 

a)  probiemes  1 i ngu  i  s t i ques  dans  les  pays  unilingues  et 
mu  1 1 1 1 i ngues 

b)  probiemes  de  langues  d  iff ic  i  lement  accessibles 
I  I  .  Moy§D§  de  reduction  de  .la  barr  idre  1 

1.  Cover-to-cover  translations 

2.  Utilisation  des  traductions  existantes 

a)  collecte  des  traductions 

b)  creation  de  BD  pour  les  traductions  existantes 

c)  probiemes  de  d£pistage  et  de  rdcupdration  des  tra¬ 
ductions  existantes 

3.  Traduction 

a)  traductions  tr ad i t ionne 1 les 

b)  traduction  de  texte  integral  par  ordinateur 

c)  traduction  assistee  par  ordinateur 


* 


III.  Institutions  e  t  traducteurs  pour  aider  £  r^duj.  r  e  J.a 
barr^re  ilD9yl § t igue 

1.  Centres  d  ’  i nf or mat i on  et  de  documentation 

a)  r61e  de  dispatcher 

b)  cooperation  A  la  production  de  BD 

c)  aide  terminologique 

2.  Le  traducteur  et  les  probl£mes  de  traduction 

a)  comprehension  de  textes  de  haute  technicite 

b)  probiemes  term i no  1 og i ques 

c)  d ic t ionna i res  et  glossaires 

IV.  Le  traducteur  et  ia  traduction  assistee  gar  ordinateur 

1.  Modernisation  des  mbthodes  actuel les  de  traduction 

a)  traitement  de  textes 

b)  imprimantes  A  alphabets  multiples 

c)  logiciels  d’aide  A  la  traduction 

2.  BD  ter m i no  1 og i ques 

a)  Eurodicautom 

b )  Ter m i urn 

c)  glossaires  et  d i c t i onna i r es  automatiques 

3.  Utilisation  des  BD  b i b 1 i og r aph i ques  er,  tant  que  source 
de  terminologie  actual isee 

a)  BD  mu  1 1 i 1 i ngues ;  utilisation  directe  des  thesaurus 
bi  BD  b  ib  1  iograph iques  avec  selection  par  sujet  et  par 
1 angue 

v.  Qonciusions 

1.  Solution  des  probl&mes  1 i ngu i st i ques  du  consommateur 
d ’ i nf or mat  ion 

2.  Information  et  aide  au  traducteur 
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I  ■  LSQQuage  barriers  and  modern  scientific  and  techrucai 
IclO-Cma  t  ion 

1.  Problems  to  provide  full-text  documents  and  to  get 
information 

a)  acces  to  documents 

b)  communicat ion  with  owners 

2.  Problems  of  understanding  litterature 

a)  problems  of  language  understanding  in  unilingual  and 
mu  1 1 i 1 1 ingual  countries 

b)  problems  with  difficult  accessible  languages 

I  I .  Means  for  language  barrier  reduction 

1.  Cover -to-cover  translations 

2.  Use  of  existing  separate  translations 

a)  collection  of  existing  translations 

b)  DB  production  for  existing  translations 

c)  problems  to  find  and  to  get  translations 

3.  Translation 

a)  human  translation 

b)  machine  translation 

c)  computer-assisted  translation 

III.  Organisations  and  translators  providing  help  to  reduce 
language  barriers 

1.  Information  and  documentation  centres 

a)  dispatching  targets 

b)  cooperation  for  translation  DB  production 

c)  terminological  help 

2.  Translator  problems 

a)  compr ehent ion  of  scientific  and  technical  texts 

b)  terminological  problems 

c)  d i c t i onnar i es  and  glossaries 

IV.  lE§D5iatgr  and  corngu t er ^assf s t ed  translation 
1.  Modernisation  of  translation  processes 

a)  word  processors 

b)  multiple-fonts  printers 

c)  softwares  for  translation  help 
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S.  Terminological  DB 

a)  Eurodicautom 

b)  Termium 

c)  computerised  d ic t ionnar ies  and  glossaries 

3.  Use  of  bibliographical  DB  as  up-to-date  terminological 
sources 

a)  multilingual  DB,  direct  use  of  thesauries 

b)  use  of  bibliographical  DB  with  selection  by  subject 
and  by  language 


V.  Conclusions 

1.  Solution  of  information  users  linguistic  problems 

2.  Information  and  help  to  translators 
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Dans  la  literature  scientifique  et  technique,  surtout  dans  les 
domaines  de  haute  technologie,  la  terminologie  est  mouvante.  La 
recherche,  le  secteur  industriel  et  le  secteur  commercial  sont  con- 
front^s  A  de  nouvelles  dAcouvertes,  de  nouveaux  produits,  done  A  de 
nouvelles  appellations.  Le  manque  de  standardisation  ter m i no  1 og i que 
et  1 ’ut i 1 isation  de  jargons  industriels  rendent  la  comprehension 
encore  plus  difficile  non  seulement  lorsqu’il  s’agit  de  langues 
AtrangAres,  mais  aussi  pour  les  usagers  pratiquant  la  langue  du 
document . 


La  science  de  1 ’ inf or mat  ion »  crAAe  au  profit  des  spAc i a  1 i stes ,  tend  A 
couvrir  exhaus t i vement  la  littArature  scientifique  et  technique  dis- 
ponible  dans  le  monde.  La  production  de  bases  et  banques  de  donnAes 
i  nf  o  rma  t  i  sAes  est  devenue  pratique  courante  et  err  fait  la  seule 
solution  d ’ informat  ion  face  A  la  masse  croissante  de  documents. 

□n  distingue  deux  genre  de  bases  et  banques  de  donnAes  :  cel les  qui 
sont  d’utilitA  universelle  (CHEMABS,  INSPEC,  METADEX,  NTIS,  etc.)  et 
cel les  qui  sont  AlaborAes  pour  un  usage  plus  restraint,  pour  une 
industrie  spAcifique,  quelques  entreprises  ou  mAme  une  seule  institu¬ 
tion.  Mais  quelles  qu’elles  soient,  leurs  producteurs  et  utilisateurs 
se  heurtent  au  problAmes  de  langue  et  de  terminologie. 


La  standardisation  ter m i no  1 og i que  elle-mAme,  aussi  bien  que  la  pro¬ 
duction  des  bases  de  donnAes,  est  Atroitement  liAe  A  la  communication 
entre  les  producteurs  de  documents,  A  la  d i spon i b i 1 i t A  de  ces  docu¬ 
ments,  au  signalement  de  ceux-ci  et  A  1 ’organisation  d’une  informa¬ 
tion  cohArente. 

La  d i sponib i 1 i t A  des  documents  est  pr ob 1 Ama t i que  pour  des  raisons 
bien  connues  : 


-  concurence  i ndustr i e 1 1 e ,  sAcuritA,  raison  d’Etat; 

-  difficultA  de  dApistage  des  producteurs  et  dAtenteurs  de  documents 
et  de  contact  avec  ceux-ci. 


La  compilation  d’une  base  de  donnAes  constitue  un  premier  pas  vers 
une  communication  meilleure  et  vers  une  standardisation  terminologi- 
que;  en  effet,  le  producteur  de  la  base  de  donnAes  contacte  les 
auteurs  des  documents,  rassemble  ces  documents  ou  leur  signalement, 
les  indexe  d’une  fa<?on  uniforme,  crAant  ainsi  une  terminologie  plus 
ou  moins  homogAne.  En  retour,  les  utilisateurs  des  bases  de  donnAes 
se  f am i 1 i ar i sent  avec  une  seule  terminologie  et  ont  tendance  A  l’uti- 
1 i ser  . 


L ’ ut i 1 i sat i on  universelle  des  bases  de  donnAes  par  des  usagers  de 
diffArents  pays  et  de  langues  diffArentes  les  habitue  tous  A  une 
seule  langue  et  une  seule  terminologie  < en  majoritA  1 ’anglais)  et 
ouvre  dAjA  une  brAche  dans  le  mur  de  1 ’ i ncompr Ahension  1 i ngu i st i que . 


II  convient  ici  de  souligner  que  les  problAmes  1 i ngu i s t i ques  sont 
diffArents  d’un  pays  A  1 ’autre  :  il  y  a  des  pays  uni  1 ingues  et 
mu  1 1 i 1 i ngues . 

Les  langues  aussi  peuvent  Atre  divisAes  en  deux  catAgories  :  les 
langues  dites  faciies  :  langues  latines  et  germaniques,  et  les  lan¬ 
gues  que  1 ’on  qualifie  de  d  i  ff  i  c  i  lenient  accessibles  :  langues  slaves, 
chinois,  japonais,  arabe,  etc.);  de  ces  derniAres  seules  seront 
retenues  cel les  qui  vAhiculent  actuel lement  une  production  scientifi¬ 
que  significative,  comme  par  exemple  le  japonais  et  le  russe. 
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Lorsque  1 ’on  dif^rencie  les  probl^mes  par  pays  on  remarque  que  les 
pays  uni  1  inques  tels  que  la  France,  la  Grande  Bretagne,  les  (JSA  ont 
des  probiemes  avec  toutes  les  langues;  mais  s’ils  ont  des  difficulty 
de  comprehension,  en  revanche  i Is  ont  depuis  longtemps  mis  en  place 
des  structures  et  cree  des  organismes  facilitant  la  traduction  pour 
les  langues  des  deux  groupes.  Les  chiffres  sont  s  i  gni  f  i  ca  t  i  f  s  :  S9*/. 

de  toutes  les  traductions  r^alis^es  concernent  les  paires  de  langues 
f r anPai s-ang 1  a i s  et  a  1 1 emand-ang 1  a i s . 

Les  pays  multilingues  tels  que  la  Suisse,  le  Canada,  la  Belgique  ont 
beaucoup  mains  de  probiemes  de  comprehension  des  textes  originaux  et 
ont  de  ce  fait  un  r£seau  de  traduction  scientifique  plus  faible,  axe 
surtout  sur  les  langues  d i f f i c i 1 ement  accessibles. 

Reduire  la  barri^re  linguistique  ne  signifie  pas  traduire  systemati- 
quement  les  textes  dans  la  langue  de  1 ’ ut i 1 i sateur  du  moment.  Lorsque 
l’on  parle  de  reseau  de  traduction  et  de  structure  facilitant  la 
traduction,  peu  nombreux  sont  ceux  qui  en  real isen1  l’ampleur  et 
la  diversite.  II  est  pourtant  primordial  que  les  utilisateurs  d ’ in¬ 
formation  et  les  specialistes  de  1 ’information  les  connaissent. 

II  y  a  en  premier  lieu  les  traductions  integrales  de  documents  (co- 
ver-to-cover  translations)  que  l’on  publie  surtout  pour  les  documents 
de  langues  d i ff i c i 1 ement  accessibles  (russe,  japonais).  II  existe 
aussi  des  ouvrages  de  reference  qui  les  signalent. 

II  y  a  egalement  un  nombre  considerable  de  textes  qui  sont  dtjA 
traduits  et  peuvent  etre  reutilises  u 1  ter i eur ement  plusieurs  fois. 
Pour  cela  il  faut  les  depister,  les  collecter  et  les  compiler  sous 
forme  d ’ouvrages  de  references  ou  de  banques  de  donn^es  pour  les 
signaler  aux  usagers  potentiels.  La  collecte  de  documents  traduits  se 
heurte  aux  difficultes  dej£  evoquees  de  concurrence,  de  securite. 

C’est  dans  ces  cas  particuliers  que  doivent  intervenir  des  groupe- 
ments  plus  restraints  garantissant  un  contrflle  des  utilisateurs,  une 
defense  des  inter@ts  de  chacun  A  l’interieur  du  groupement  ainsi  que 
le  secret  prof ess ionnel  pour  les  fournisseurs  de  traductions  et  leurs 
utilisateurs.  Ce  n’est  que  dans  ce  contexte,  en  respectant  les  fac- 
teurs  evoques  qu’il  est  possible  de  rassembler  les  traductions  dejA 
r ea 1 i sees . 

II  va  de  so i  que  la  base  de  donnees  est  la  solution  la  plus  valable 
pour  mettre  ces  traductions  A  la  disposition  des  interesses.  Pour 
exemple  il  convient  de  citer  le  "World  Translation  Index",  base  de 
donnees  qui  propose  les  traductions  rassembiees  en  Europe  et  aux  USA 
par  le  National  Translation  Centre  (USA)  et  1 ’ I nter nat i ona 1  Transla¬ 
tion  Centre  (Europe).  Le  WTI  est  Egalement  diffuse  sous  forme  tradi- 
tionnelle  d’ouvrage  de  reference. 

Certains  groupements  signalent  pour  leurs  membres  les  traductions 
effectu£es  au  sein  du  groupe,  par  exemple  le  Centre  de  Recherches 
rteta 1 lur g iques ,  Belgique. 

En  dernier  recours  pour  abolir  la  barri^re  linguistique  il  y  a  evi- 
demment  la  traduction.  On  distinque  : 

-  la  traduction  t r ad i t i onne 1 1 e ; 

-  la  traduction  r£alis£e  par  ordinateur; 

-  la  traduction  assistee  par  ordinateur. 

Le  mode  de  traduction  traditionnel  fait  appel  aux  traducteurs  et 
presente  certaines  difficultes,  surtout  pour  les  textes  techniques 
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tr£s  specialises  :  le  traducteur  se  heurte  A  des  probl^mes  de  compre¬ 
hension  technique  et  de  terminologie. 

La  traduction  par  ordinateur  a  fait  de  grands  proqres,  mais  son 
efficacite  et  la  quality  de  la  traduction  sont  encore  discutables.  De 
multiples  etudes  en  ont  parie,  evaluant  ses  avantages  et  ses  defauts. 
On  estime  toutefois  que  ces  traductions  sont  valables  du  moins  pour 
1 ’ informat  ion  des  sc  i  ent  i  f  i  que.  s  et  e  condition  que  le  vocabulaire 
introduit  dans  l’ordinateur  soit  limite  A  un  domaine  bien  determine. 
II  est  utile  de  souligner  le  resultat  positif  obtenu  pour  1 ’ indexda- 
tion  des  documents  et  la  preparation  des  resumes  A  introduire  dans 
les  bases  de  donnees.  Les  textes  standards  ou  repetitifs  sont  egale- 
ment  assez  bien  traduits.  L’avantage  principal  de  la  traduction  par 
ordinateur  est  i nd i scut ab 1 ement  la  possibilite  de  standardisation  de 
la  term i no  log i e .  Le  phenomene  de  standard isat ion  provoque  par  les 
bases  de  donnees  pour  une  langue  se  manifests  ici  pour  plusieurs 
langues.  Ma 1 heur eusement  les  reproches  essentiels  que  l’on  formule 
generalement  sont  considerables  :  prix  trop  eieve,  perte  de  temps 
considerable  en  revision  des  textes,  inaptitude  de  l’ordinateur  A 
prendre  en  consideration  les  nuances  du  texte. 

La  traduction  assistee  par  ordinateur,  quant  A  elle  plus  flexible, 
combine  le  travail  creatif  du  traducteur  et  les  possibilites  de 
1 ’ i nf ormat i que .  C’est  pour  cela  qu’elle  presente  une  solution  avanta- 
geuse,  valable,  realisable  sans  frais  excessifs.  Ses  avantages  sont 
evidents  pour  1 ’ ut i 1 isateur  des  textes  et  pour  le  traducteur,  le 
rendement  et  la  qualite  de  la  traduction  s ’ en  trouvent  ameiiores. 

On  constate  que  les  possibilites  de  reduction  de  la  barriere  linguis- 
tique  sont  nombreuses;  il  sera  done  utile  de  connattre  les  institu¬ 
tions  qui  les  mettent  en  oeuvre. 

II  y  a  avant  tout  les  organismes  de  traduction  et  de  diffusion  des 
bases  et  banques  de  donnees  : 

-  les  producteurs  de  bases  et  banques  de  donnees; 

-  les  centres  d ’ information; 

-  les  centres  de  signalement  des  traductions; 

-  les  centres  termino 1 og iques . 

Des  organismes  contactent  les  producteurs  de  documents,  compilent  la 
litterature,  mettent  1 ’ information  A  la  disposition  de  1 ’ ut i 1 i sateur , 
rassemblent  les  traductions  existantes,  produisent  et  exploitent  des 
d i c t i onna i r es  ter m i no  1 og iques  i nf or mat i ses .  Ils  sont  utiles  A  l’usa- 
ger  puisqu’iis  r^pondent  de  diverses  fa?ons  A  leurs  besoins  en  infor¬ 
mation  et  abolissent  la  barriere  1 i ngu i s t i que .  Ils  sont  egalement  les 
auxilliaires  du  traducteur  puisqu’iis  sont  impliques  dans  la  produc¬ 
tion  et  1 ’exploitation  des  sources  termi no  1 og i ques  et  des  sources 
d’ information  sur  les  traductions  existantes. 

L ’ u t i 1 i sa teur  d ’ inf ormat ion  confronts  aux  prooiemes  1 i ngu i st i ques  et 
le  traducteur  qui  doit  1 ’aider  ont  tous  deux  le  plus  grand  interet  A 
les  connaftre  et  £  recourir  A  leurs  services. 

N^anmoins  lorsqu’il  est  question  d’abolir  la  barriere  linguistique 
c’est  bien  le  traducteur  qui  est  le  per sonnage-c 1 A .  Ces  probl&mes 
meritent  des  lors  une  etude  detailiee. 

Plus  que  jamais  le  traducteur  se  trouve  confronts  A  des  textes 
sc ient if iques  tr£s  specialises  et  e  des  descriptions  de  haute  tech- 
nicite.  II  est  oblige  de  faire  face  aux  probiemes 
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-  de  comprehension  des  textes; 

-  de  recherche  ter m i no  log i que  pour  des  techniques  de  pointe; 

-  de  recherche  des  sources  d’ information; 

-  d’adaptation  aux  methodes  modernes  de  travail  et  de  recherche  de 
1 ’ i nformat ion . 


La  comprehension  des  textes  est  rendue  difficile  par  la  complexity 
des  sujets  qui  pour  55*/.  reinvent  du  domaine  des  sciences  precises  et 
de  la  haute  technologie;  la  spec ial isat ion  tres  poussee  des  disci¬ 
plines  impliquees  dans  chaque  sujet  exigent  une  certaine  formation 
scientifique  ou  tout  au  moins  un  contact  etroit  avec  le  spec ial iste. 

La  recherche  des  termes  equivalents  dans  la  langue  source  et  dans  la 
langue  cible  est  rendue  extr&mement  difficile  £  cause  de  la  mouvance 
de  la  terminologies  du  manque  de  standardisation  dans  les  definitions 
de  nouveaux  produits  et  procedes,  des  jargons  industriels  :  le  tra- 
ducteur  consacre  en  moyenne  17*/.  de  son  temps  k  la  recherche  d  ’  infor¬ 
mation  et  58*/.  seulement  k  la  traduction  proprement  dite. 

Les  d ict ionnaires  -  outils  tr ad i t i onne 1 s  du  traducteur  -  presentent 
un  decalage  considerable  par  rapport  aux  textes  proposes,  surtout 
dans  les  domaines  mentionn^s  et  dans  les  disciplines  de  pointe. 

L’essence  m@me  du  dictionnaire  et  les  methodes  d’edition  ne  permet- 
tent  pas  une  mise  k  jour  suffisamment  rapide;  on  estime  generalement 
qu’ils  ont  4  ans  de  retard  sur  1 ’evolution  des  techniques. 

L’adaptation  aux  nouvelles  techniques  est  difficile,  tributaire  d’une 
information  quas i - i nex i stante  dans  le  monde  de  la  traduction,  d’ap- 
prentissages  longs  et  d ’ invest i ssements  financiers  consequents. 


II  faut  se  rendre  k  1 ’evidence  que  malgres  ces  reserves,  seule  la 
technique  des  ordinateurs  peut  k  la  longue  resoudre  tous  ces  pro- 
blames.  C’est  k  cette  technique  qu’ont  eu  recours  les  documental istes 
et  les  special istes  de  1 ’information;  c’est  vers  elle  que  doivent  se 
tourner  desormais  les  traducteurs.  Pour  cette  raison  il  convient  de 
les  en  informer  le  plus  largement  possible. 


La  traduction  assistee  par  ordinateur  est  un 
plusieurs  aspects  de  1 ’ informat isat ion  et  qui 
etapes  du  processus  de  traduction. 


concept  qui  reprend 
s’applique  k  plusieurs 


L ’ ut i 1 isat i on  du  PC  (personal  computer) 


permet  au  traducteur  de  : 


-  beneficier  de  tous  les  avantages  d’un  traitement  de  texte; 

-  traiter  le  texte  source  et  le  texte  cible  en  les  visualisant 
simul tanement  sur  l’ecran; 


-  avoir  des  logiciels  possedant  plusieurs  jeux  de  carractferes  (latin, 
cyrillique,  grec,  hebreu,  etc.)  et  utiliser  simul tanement  plusieurs 
d’entre  eux ,  par  exemple  un  texte  en  cyrillique  avec  denominations 
latines  pour  un  sujet  medical; 

-  #tre  raccorde  par  modem  aux  serveurs  de  son  choix  pour  obtenir 
1 ’ informat  ion  ter m i no  1 og ique  et  scientifique  necessaire; 

-  constituer  ou  acqu^rir  des  glossaires  specialises; 

-  utiliser  des  logiciels  d’aide  k  la  traduction. 


Actuel  lement  51*/.  des  traducteurs  utilisent  un  equipement  informatise, 
mais  la  majority  1 ’utilisent  presque  uniquement  pour  le  traitement  de 
textes  et  les  tdches  administrat ives  auxiliaires. 
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Les  bases  de  donn^es  termi no  log iques ,  autre  aspect  de  1 ’informatisa¬ 
tion  sont  susceptibles  de  fournir  une  aide  pr^cieuse  aux  traducteurs. 
Malheureusement  seuls  14*/.  d’entre  eux  utilisent  des  bases  de  donn^es 
terminologiques  contre  91'/.  qui  continuent  A  se  servir  de  diction- 
naires  specialises  tradi t ionnels . 

Pourtant  les  banques  de  donnees  terminologiques  sont  simplement  des 
d ic t ionna i r es  bilingues,  parfois  mu  1 1 i 1 i ngues ,  parfois  encyclopedi- 
ques,  differents  des  d i c t i onna i r es  tr ad i t i onne 1 s  par  le  fait  qu’ils 
emploient  un  support  magnetique  en  lieu  et  place  du  support  papier. 
Ils  sont  done  memorisables  et  peuvent  etre  manipuies  par  ordinateur. 
Ce  sont  des  d i c t i onna i res  automatiques  et  ils  possedent  tous  les 
avantages  des  enreg i str ements  pour  ordinateurs  :  maniability,  rapidi- 
te  de  reponse.  facilite  d ’ introduction  de  nouvelles  donnees.  C’est 
dire  que  ce  genre  de  dictionnaire  permet  d’obtenir  la  reponse  y  une 
recherche  termino log ique  en  quelques  secondes,  ma i s  aussi  qu’ils 
peuvent  etre  regu 1 ierement  completes  et  mis  y  jour  de  fa?on  perma- 
nente.  II  est  evident  que  ce  nouveau  type  de  dictionnaire  sera  tres 
utile  pour  les  traducteurs. 

Actuel lement  il  n’y  a  pas  encore  beaucoup  de  banques  de  donnees 
terminologiques;  cel les  qui  existent  ne  comportent  pas  encore  assez 
de  termes  pour  couvrir  toutes  les  langues  et  tous  les  domaines  des 
sciences.  mais  m£me  telles  qu’elles  se  pr^sententi  el les  ont  dejA 
rendu  bon  nombre  de  services  aux  traducteurs. 

II  convient  de  citer  en  exemple  le  volume  de  donnees  des  principales 
d ’ entres  el les  : 

-  TEAM,  compi 16e  par  la  firme  Siemens,  est  une  banque  de  donnees 
multilingue  qui  contient  dyjy  plus  de  500.000  termes; 

-  le  dictionnaire  multilingue  encyc 1 op6d i que  EUROD I CAUTOM ,  produit  par 
le  service  termi no  1 og i que  de  la  CEE,  comporte  pr£s  de  350.000  termes 

-  la  banque  termino log ique  mu  1 1 i d i sc  ip  1 i na i r e  du  gouvernement  canadien 

TERM1UM  est  une  banque  de  donnees  bilingue  f r anga i s-ang 1 ai s , 

anglais-frangais;  son  volume  atteint  les  B00 . 000  termes. 

Ces  exemples  montrent  que  si  ces  banques  terminologiques  ne  sont  pas 
encore  exhaustives,  le  nombre  de  termes  enregistr£s  les  rend  dyjy 
exploitables  avec  un  taux  de  r£ponses  positives  sat i sf a i sant . 

En  tout  cas  leurs  avantages  sur  les  d i c t i onna i res  tr ad i t ionnel s  sont 
i ncont es tab  1 es  :  maniability,  rapidity  de  reponse,  mise  A  jour  perma- 
nente.  Deux  facteurs  importants  militent  aussi  en  leur  faveur  :  la 

possibility  de  dialogue  en  direct  et  la  gratuity  (dans  plusieurs  cas) 
due  y  une  exploitation  au  stade  expyrimental  et  pr omo t i onne 1 . 

On  ass i ste  auss i  y  une  affluence  sur  le  mar chy  de  glossaires  plus 
spycialisys,  plus  petits,  y  choisir  par  le  traducteur  en  fonction  de 
ses  besoins  et  y  exploiter  sur  son  propre  PC. 

Les  bases  de  donnees  b i b 1 i ogr aph i ques  reprysentent  par  dyfinition  une 
source  termi no  log ique  inypuisable  et  actualisye;  el les  donnent  sou- 
vent  satisfaction  iy  Du  les  mythodes  tr ad i t i onnel 1 es  et  les  banques 
de  donnees  terminologiques  n’ont  pas  fourni  1 ’ informat  ion  recherch^e. 
Ces  bases  de  donnees  rypertorient  la  quas i -tota 1 i ty  des  publications 
dans  tous  les  domaines  des  conna i ssances  :  cela  signifie  que  toutes 
les  langues  y  sont  r eprysentyes . 

II  y  a  deux  faQons  d’utiliser  les  bases  de  donnyes  pour  des  recher- 
ches  terminologiques. 
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1.  Lorsque  la  base  de  donnyes  el le-m&me  est  bilingue  ou  multilingue, 
il  est  possible  d’obtenir  arte  r^ponse  directement.  En  effet,  dans  ce 
cas»  les  titres  et  les  mots-cl^s  y  f igurent  dans  toutes  les  langues 
utilisyes.  On  peut  c i ter  comme  exemple  la  base  de  donn^es  multidisci- 
plinaire  PASCAL  qui  est  multilingue  ( f ranga i s-ang 1  a i s-a 1 1 emand-espa- 
gnol).  Ilya  aussi  la  base  de  donnyes  TITUS  (textile)  qui  est 
propos^e  en  trois  versions  :  fran?aise,  anglaise  et  espagnole  et  qui 
comporte  done  directement  la  terminologie  de  l’industrie  du  textile 
en  trois  langues.  Cette  premiere  fa<?on  d’utiliser  les  bases  de  don¬ 
nyes  permet  d’obtenir  les  termes  recherch^s  dans  les  langues  de  la 
traduction  directement  lors  de  la  consultation. 

2.  La  deuxiyme  alternative  concerne  les  langues  qui  ne  sont  pas 
utilises  lors  de  la  production  des  bases  de  donnyes  :  par  exemple, 
le  russe  ou  le  japonais.  Dans  ce  cas  la  solution  consiste  A  syiec- 
tionner  les  ouvrages  qui  traitent  le  sujet  soumis  au  traducteur  dans 
la  langue  originale  et  dans  la  langue  cible.  A  la  difference  d’un 
traducteur  qui  essaye  de  se  documenter  sans  donn^es  de  depart  bien 
determines,  l’ordinateur  fournit  i  mmed  i  atement  les  references  pre¬ 
cises  et  pertinentes  des  ouvrages  traitant  le  sujet  presentant  la 
terminologie  recherchye. 

Les  avantages  de  cette  approche  par  rapport  aux  methodes  tradition- 
nelles  qu’utilisent  les  traducteurs  sont  evidents  :  rapidity,  perti¬ 
nence  des  reponses  et,  en  plus,  possibility  de  commander  rapidement 
les  ouvrages  select ionnes  (par  le  m@me  ordinateur)  dans  le  cas  ou  les 
b ib 1 iothyques  les  plus  proches  ne  les  posseder a i ent  pas.  Un  avantage 
suppiementaire  non  nygligeable  :  les  ouvrages  signaies  par  les  bases 
de  donnees  sont  recents,  ecr its  par  des  special istes  dans  leur  lan¬ 
gue;  cela  permet  au  traducteur  d’obtenir  la  terminologie  usuelle, 
prise  dans  son  contexte,  au  lieu  de  celle  des  d i c t i onna i r es ,  qui  est 
parfois  artificielle  et  inappropr i£e . 

En  plus  par  1 ’ i nter m£d i a i r e  de  divers  centres  d’ information  ou  bu¬ 
reaux  special ises  les  banques  de  donnees  ter m i no  1 og i ques  et  les  bases 
de  donnees  b ib 1 iograph iques  sont  accessibles  aux  interesses  m£me 
s’ils  ne  sont  pas  encore  yquipys  en  outils  i nf or mat i ses . 

La  comparaison  des  i nvest i ssements  en  temps  de  recherche  et  en  dic- 
tionnaires  et  des  invest issements  eventuels  en  recherche  par  ordina¬ 
teur  est  nettement  favorable  h  la  mythode  nouvelle. 

CONCLUSIONS 

La  reduction  des  difficultys  1 inguist iques  comporte  deux  aspects  : 

-  la  solution  des  probiymes  du  consommateur  de  1 ’ i nf or mat i on , 

-  1 ’ i nf or mat i on  et  l’assistance  du  traducteur. 

L’abolition  des  barrieres  1 i ngu i s t i ques  est  ytroitement  1 iye  y  1 ’or¬ 
ganisation  de  1 ’information  scientifique  et  technique  et  rencontre 
les  m^mes  probiymes  :  dypistage  et  compilation  des  documents,  stan¬ 
dardisation  term i no  log ique . 

Les  progrys  de  1 ’ informat  ion  informatis^e  diminue  les  barrieres  lin- 
guistiques  en  imposant  une  seule  langue  vyhiculaire  et  une  standardi¬ 
sation  termino log ique . 

L’examen  des  probiymes  1 inguist iques  montre  que  1 ’approche  du  pro¬ 
blems  est  diffyrente  pour  chaque  communauty  linguistique  et  pour 
chaque  groupe  de  langues. 
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II  est  cependant  Evident  qu’il  faut  mettre  en  oeuvre*  signaler  aux 
interesses  et  utiliser  tous  les  moyens  visant  A  faciliter  la  compre¬ 
hension  :  cover-to-cover  translations*  BD  de  traductions,  etc.  II  est 
tout  aussi  important  d’atteler  £  cette  tSche  les  producteurs  et 
d istr ibuteurs  d’ information  que  les  traducteurs. 

II  apparalt  que  si  les  probiemes  du  traducteur  sont  complexes,  les 
methodes  modernes,  1 ’ i nf or mat i sat  ion  et  surtout  une  bonne  informa¬ 
tion  des  traducteurs  sont  susceptibles  de  les  r^soudre.  La  creation 
d’outils  nouveaux  pour  les  traducteurs  et  1 ’ut i 1 isat ion  rationelle  de 
1 ’information  mise  &  leur  disposition  est  inevitable  pour  reactuali- 
ser  le  metier  de  la  traduction. 

La  coordination  de  tous  les  efforts  peut  assurer  la  realisation  de 
1 ’ensemble  des  mesures  qui  permettrons  une  meilleure  comprehension 
de  1  ’  inf  ormat  ion  par  les  spec  ial  istes  ,  la  creation  et  ^implementa¬ 
tion  d’outils  appropries  pour  faciliter  le  processus  de  traduction. 
Les  barrieres  1 inguist iques  s ’ en  trouveront  cons ider ab lement  re- 
du i tes . 
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\  SUMMARY 

4l 

Linguistic  problems  are  in  close  connection  with  information  problems.  They  therefore  are  examined  jointly  in  order  to  solve 
the  terminological  problems  and  to  reach  the  documentary  mass.  The  problems  of  comprehension  of  the  texts  by  the  specialist 
are  examined  as  well  as  the  difficulties  raised  by  the  translator.  Linguistic  problems  are  examined  according  to  the 
characteristics  of  the  unilingual  and  multilingual  countries  and  according  to  the  groups  of  languages.  The  existing  means  to 
reduce  linguistic  barrier  and  implied  institutions  are  considered  from  the  point  of  view  of  the  user  and  of  the  translator.  The 
translator  is  the  keyman  to  overcome  linguistic  barrier.  He  is  confronted  with  the  problems  of  scientific  and  technical  text 
comprehension,  of  search  for  information  and  technical  terms  sources,  of  adaptation  to  the  modern  work  methods.  The  use  of 
data  processing  provides  him  some  help  and  multiple  possibilities  of  rationalization  of  the  translation  process.  Among  the 
known  methods  of  translations,  computer-assisted  translation  seems  currently  the  most  performing:  use  of  a  PC  for  translation 
process  as  such,  use  of  computerized  dictionaries  and  bibliographical  data  bases  as  multilingual  terminology  sources  reliable 
and  up-to-date.  The  whole  of  raised  measures  makes  it  possible  to  reduce  linguistic  difficulties  of  the  information  user  and  to 
ensure  an  effective  help  to  the  translator. 

<\ 

1.  Linguistic  barriers  and  modern  scientific  and  technical  information. 


1 .  Problems  to  provide  full-text  documents  and  to  get  information 

a)  access  to  documents; 

b)  communication  with  the  information  owners. 

2.  Linguistic  problems 

a)  linguistic  problems  tn  unilingual  and  multilingual  countries; 

b)  problems  with  difficult  accessible  languages. 

II.  Means  for  language  barrier  reduction. 

1 .  Cover-to-cover  translation; 

2.  Use  of  the  existing  translations; 

a)  collection  of  translations: 

b)  DATA  BASE  production  for  existing  translations: 

c)  problems  to  find  and  to  get  translations. 

.1.  Translation 

a)  human  translation; 

b)  machine  translation; 

c)  computer-assisted  translation. 

III.  Institutions  and  translators  providing  help  to  reduce  the  linguistic  barrier. 

1 .  Information  and  documentation  centres. 

a)  dispatcher  targets; 

b)  cooperation  for  DATA  BASE  production: 

c)  terminological  help. 

2.  Translator  and  problems  of  translation. 

a)  text  comprehension  with  high  technicality: 

b)  terminological  problems; 

c)  dictionaries  and  glossaries. 

IV.  Translator  and  computer-assisted  translation. 

1 .  Modernisation  of  the  current  translation  methods, 

a)  word  processing; 

b)  multiple-fonts  printers; 

c)  software  for  translation  help. 
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2.  Terminological  DATA  BASES. 

a)  Eurodicautom; 

b)  Termium; 

c)  Computerised  glossaries  and  dictionaries. 

3.  Use  of  bibliographical  DATA  BASES  as  up-to-date  terminological  sources. 

a)  Multilingual  DATA  BASES;  direct  use  of  the  thesaurus; 

b)  Bibliographical  DATA  BASE  with  selection  by  subject  and  by  language. 

V.  Conclusions. 

1 .  Solution  of  the  information  users  linguistic  problems;. 

2.  Information  and  help  to  the  translator. 

KEYWORDS 

Bibliographical  DATA  BASE, 

Translation  DATA  BASE, 

Terminological  DATA  BASE, 

Cover-to-cover  translations, 

Machine-readable  dictionaries  and  glossaries, 

Eurodicautom, 

Multiple-fonts  printers. 

Difficult  accessible  languages. 

Software  for  translation  help, 

Microprocessor, 

Scientific  and  technical  terminology, 

Termium, 

Thesaurus, 

Computer-assisted  translation. 

Machine  translation, 

Human  translation. 

Word  processing. 

In  scientific  and  technical  literature,  especially  in  the  fields  of  high  technology,  the  terminology  is  moving.  Research,  the 
industrial  sector  and  the  commercial  sector  are  confronted  with  a  new  discoveries,  new  products,  therefore  with  new  names. 
The  lack  of  terminological  standardization  and  the  use  of  industrial  jargons  make  comprehension  even  more  difficult  not  only 
for  foreign  languages,  but  also  for  the  users  who  practice  the  document  language. 

Information  science,  created  to  specialists,  tends  to  cover  exhaustively  the  scientific  and  technical  literature  available  in  the 
world.  The  production  of  bases  and  computerized  data  banks  has  become  practical  current  and  is  in  fact  the  only  solution  of 
information  in  relation  to  the  increasing  document  mass. 

Two  kinds  of  bases  and  data  banks  are  distinguished:  those  of  universal  utility  (CHEM  ABS,  INSPEC.  METADEX.  NTIS.  etc) 
and  those  worked  out  for  a  more  restricted  use,  for  a  specific  industry,  several  firms  or  even  only  one  institution.  But  whatever 
they  are,  producers  and  users  come  up  against  language  and  terminology  problems. 

Terminological  standardization  itself,  as  well  as  the  production  of  data  bases,  is  closely  dependent  on  the  communication 
between  the  document  producers,  the  availability  and  announcement  of  these  documents,  and  the  organization  of  coherent 
information. 

The  availability  of  documents  is  problematic  for  well-known  reasons: 

—  industrial  competition,  safety,  reason  of  state; 

—  difficulty  to  find  the  documents'  producers  and  owners  and  get  in  touch  with  them. 

A  data  base  compilation  constitutes  a  first  step  towards  a  better  communication  and  towards  terminological  standardization; 
indeed,  the  producer  of  a  data  base  contacts  the  documents’  writers,  gathers  the  documents  or  their  annoucement.  indexes 
them  in  a  uniform  way,  thus  creating  a  more  or  less  homogeneous  terminology.  In  return,  the  users  of  the  data  bases  get  used  to 
only  one  terminology  and  tend  to  use  it.  to  use  it. 

The  universal  use  of  the  data  bases  by  users  of  various  countries  and  of  various  languages  makes  all  of  them  used  to  only  one 
language  and  only  one  terminology  (in  majority  English)  and  already  breaks  through  the  wall  of  linguistic  incomprehension. 

It  is  advisable  here  to  stress  that  the  linguistic  problems  are  different  from  a  country  to  another;  there  are  unilingual  and 
multilingual  countries. 
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Languages  also  can  be  divided  into  two  categories:  easy  said  languages:  Germanic  and  Latin  languages,  and  languages  that  one 
describes  as  difficult  accessible:  Slavic,  Chinese,  Japanese.  Arab  languages,  etc);  of  these  last  only  will  be  retained  those  which 
convey  currently  a  significant  scientific  production,  such  as  for  example  Japanese  and  Russian. 

When  one  differentiates  the  problems  by  country,  one  points  out  that  the  unilingual  countries  such  as  France.  Great  Britain,  the 
USA  have  problems  with  ajl  languages;  but  if  they  have  difficulties  of  comprehension,  on  the  other  hand  they  have  set  up 
structures  for  a  long  time  and  created  organisms  facilitating  the  translation  for  languages  of  both  groups.  The  figures  are 
significant:  29%  of  all  the  translations  carried  out  concern  the  pairs  of  French-English  and  German-English  languages 

The  multilingual  countries  such  as  Switzerland.  Canada.  Belgium  have  much  less  comprehension  problem  of  the  original  texts 
and  have  consequently  a  weaker  network  of  scientific  translation,  centred  especially  on  difficult  accessible  languages. 

Language  barrier  reduction  does  not  mean  to  translate  systematically  the  texts  in  the  language  of  the  user.  When  one  speaks 
about  network  translation  and  structure  facilitating  translation,  very  few  realize  its  width  and  diversity.  However  it  is  essential 
that  information's  users  and  specialists  know  them. 

There  are  initially  the  full-translations  of  documents  (cover-to-cover  translations)  that  one  publishes  especially  for  the  difficult 
accessible  languages  (Russian.  Japanese).  They  are  also  announced  by  reference  works. 

There  is  also  a  considerable  number  of  texts  which  are  already  translated  and  can  be  re-used  several  times  at  a  later  date.  For 
that,  one  should  detect  them,  collect  them  and  compile  them  in  the  form  of  reference  works  or  data  banks  to  announce  them  to 
potential  users.  The  translated  document  collection  comes  up  against  the  already  raised  difficulties  of  competition  and  safety. 

In  these  particular  cases  have  to  intervene  more  restricted  groupings  to  guarantee  control  of  the  users,  defence  of  the  interests 
of  each  one  inside  the  grouping  as  well  as  professional  secrecy  for  the  translation  suppliers  and  their  users.  It  is  only  in  this 
context,  by  respecting  the  raised  factors  that  it  is  possible  to  gather  the  translations  already  carried  out. 

It  goes  without  saying  that  the  data  base  is  the  most  valid  solution  to  put  these  translations  at  the  disposal  of  the  interested 
parties.  For  example  it  is  advisable  to  quote  "World  Translation  Index",  data  base  which  proposes  the  translations  gathered  in 
Europe  and  in  the  USA  by  the  National  Translation  Centre  (USA)  and  the  International  Translation  Centre  (Europe).  WTI  is 
published  also  in  the  form  of  a  traditional  reference  work. 

Certain  groupings  point  out  for  their  members  the  translations  carried  out  within  the  group,  for  example  the  Metallurgical 
Research  Centre.  Belgium. 

As  a  last  resort  to  overcome  linguistic  barrier,  there  is  obviously  the  translation.  One  distinguishes: 

—  human  translation. 

—  machine  translation; 

—  computer-assisted  translation. 

The  traditional  translation  methods  requires  translators  and  raises  certain  difficulties,  especially  for  the  very  specialized 
technical  texts:  the  translator  encounters  problems  of  technical  comprehension  and  terminology. 

Computer-translation  made  major  progress,  but  its  effectiveness  and  quality  of  translation  is  still  debatable.  Multiple  studies 
spoke  about  it.  evaluating  its  advantages  and  its  defects.  It  is  considered  however  that  these  translations  are  valid  at  least  for 
scientists'  information  and  provided  that  the  vocabulary  input  into  the  computer  is  limited  to  a  properly  determined  field.  It  is 
useful  to  stress  the  positive  result  obtained  for  documents  indexingation  and  preparation  of  summaries  to  be  introduced  in  d-?a 
bases.  Standard  or  repetitive  texts  are  also  rather  well  translated.  The  principal  advantage  of  computer-translation  is 
indisputably  the  possibility  of  terminological  standardization.  Standardisation  phenomenon  caused  by  data  bases  for  one 
language  appears  here  for  several  languages.  Unfortunately  the  essential  reproaches  that  are  generally  formulated  are:  price  too 
high,  considerable  waste  of  time  in  post-edition,  computer  inadpitude  to  take  into  consideration  text  nuances. 

Computer-assisted  translation,  as  for  it  more  flexible,  combines  translator's  creative  work  and  data  processing  possibilities. 
That  is  why  it  presents  an  advantageous,  valid,  realizable  solution  without  excessive  expenses.  Its  advantages  are  obvious  for 
texts'  user  and  translator;  translation  output  and  quality  are  improved. 

It  is  noted  that  the  possibilities  of  linguistic  barrier  reduction  are  numerous;  it  will  be  therefore  useful  to  know  the  institutions 
which  implement  them. 

There  are  above  all  translation  and  diffusion  organisms  of  the  bases  and  data  banks: 

—  base  producers  and  data  banks; 

—  information  centres; 

—  translation  announcement  centres; 

—  terminological  centres. 

Organisms  contact  the  document  producers,  compile  literature,  put  information  at  user  s  disposal,  gather  the  existing 
translations,  produce  and  exploit  computerized  terminological  dictionaries.  They  are  useful  to  the  user  since  they  answer  in 
various  ways  their  information  requirements  and  overcome  linguistic  barrier.  They  are  also  the  translator's  auxiliaries  since 
they  are  implied  in  the  production  and  exploit  ttion  of  the  terminological  and  information  sources  on  the  existing  translations. 


The  information  user  confronted  with  linguistic  problems  and  the  translator  who  has  to  help  him  have  both  a  major  interest  to 
know  them  and  to  resort  to  their  services. 

Nevertheless,  when  it  is  question  of  overcoming  barrier  linguistic,  the  translator  is  actually  the  keyman.  These  problems 
deserve  consequently  a  detailed  study. 

More  than  ever  the  translator  finds  himself  confronted  with  very  specialized  scientific  texts  and  with  high  technicality 
descriptions.  He  is  compelled  to  cope  with  the  following  problems: 

—  comprehension  of  the  texts; 

—  terminological  research  for  advanced  techniques; 

—  research  of  information  sources; 

—  adaptation  to  modem  work  methods  and  information  retrieval. 

Texts  comprehension  is  made  difficult  by  the  complexity  of  subjects  which  for  55  %  are  relevant  to  precise  sciences  field  and 
high  technology;  the  vti y  high  specialization  of  the  disciplines  implied  in  each  subject  requires  a  certain  scientific  training  or  at 
least  a  close  contact  with  the  specialist. 

Research  of  the  equivalent  terms  in  source  and  target  language  is  made  extremely  difficult  because  of  terminology  changes,  lack 
of  standardization  in  defining  new  products  and  methods,  industrial  jargons:  the  translator  devotes  on  average  1 7  %  of  his  time 
in  information  retrieval  and  58  %  only  to  the  translation  as  such. 

Dictionaries  —  the  traditional  translator’s  tools  —  present  a  considerable  shift  in  relation  to  the  proposed  texts,  especially  in  the 
fields  mentioned  and  in  advanced  disciplines. 

The  very  purpose  of  a  dictionary  and  edition  methods  does  not  allow  any  sufficiently  rapid  update;  it  is  generally  considered 
that  they  are  4  years  late  on  technological  developments. 

The  adaptation  to  the  new  techniques  is  difficult,  dependent  on  almost  not-existent  information  in  the  world  of  translation,  long 
trainings  and  consequent  financial  investments. 

One  should  admit  that  in  spite  of  these  reserves,  only  computer  technique  can  at  last  solve  all  these  problems.  It  is  to  this 
technique  that  had  resorted  documentalists  and  information  specialists:  translators  have  to  consider  it  now.  For  this  reason  it  is 
advisable  to  inform  them  the  most  widely  possible. 

Computer-assisted  translation  is  a  concept  which  takes  again  several  aspects  of  computerization  and  which  applies  at  several 
stages  of  the  translation  process. 

The  use  of  the  PC  (personal  computer)  allows  the  translator: 

—  to  benefit  from  all  the  advantages  of  text  processing; 

—  to  process  the  source  and  target  text  in  visualizing  them  simultaneously  on  the  screen; 

—  to  have  software  having  several  character  sets  (Latin,  Cyrillic,  Greek,  Hebrew,  etc)  and  to  use  simultaneously  several  of 
them,  for  example  a  text  in  Cyrillic  with  Latin  denominations  for  a  medical  subject; 

—  to  be  connected  by  host  modem  of  his  choice  to  obtain  necessay  terminological  and  scientific  information; 

—  to  constitute  or  acquire  specialized  glossaries; 

—  to  use  software  for  translation  help. 

Currently  5 1  %  of  the  translators  use  computerized  equipment,  but  a  majority  of  them  uses  it  quite  only  for  text  processing  and 
auxiliary  administrative  tasks. 

Terminological  data  bases  which  are  another  aspect  of  computerization  are  likely  to  provide  precious  help  for  translators. 
Unfortunately,  only  14  %  of  transaltors  use  terminological  data  bases  against  91  %  who  continue  making  use  of  traditional 
specialized  dictionaries. 

However,  terminological  data  banks  are  simply  dictionaries  bilingual,  sometimes  multilingual,  sometimes  encyclopaedic, 
different  from  the  traditional  dictionaries  by  the  fact  that  they  employ  a  magnetic  medium  instead  of  paper  medium.  They  are 
therefore  memorized  and  can  be  handled  by  computer.  They  are  automatic  dictionaries  and  they  have  all  the  advantages  of  the 
recordings  for  computers:  answer  handiness,  speed,  input  facility  of  new  data.  It  is  to  tell  that  this  kind  of  dictionary  makes  it 
possible  to  obtain  the  response  to  terminological  research  in  a  few  seconds,  but  also  that  they  can  regularly  be  supplemented 
and  updated  in  a  permanent  way.  It  is  obvious  that  this  new  type  of  dictionary  will  be  very  useful  for  the  translators. 

Currently  there  are  not  yet  many  terminological  data  banks;  those  which  exist  do  not  yet  comprise  enough  terms  to  cover  all 
languages  and  all  the  fields  of  sciences,  but  even  as  they  are.  they  have  been  already  very  useful  to  the  translators. 

It  is  advisable  to  mention  data  size  of  the  most  important  data  banks: 
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—  TEAM,  compiled  by  the  firm  Siemens,  is  a  multilingual  data  bank  which  contains  already  more  than  500.1)110  terms 

—  the  multilingual  encyclopaedic  dictionary  EUROD1CAUTOM,  produced  by  the  terminological  service  of  the  EEC 
comprises  nearly  350.000  terms; 

—  the  multidisciplinary  terminology  data  bank  of  the  Canadian  government  TERMICM  is  a  bilingual  French-English, 
English-French  data  bank;  its  volume  reaches  the  800,000  terms 

These  examples  show  that  if  these  terminological  data  banks  are  not  yet  exhaustive,  the  number  of  recorded  terms  makes  them 
workable  already  with  a  satisfying  positive  answer  rate. 

In  any  case  their  advantages  over  traditional  dictionaries  are  undeniable:  answer  handiness,  speed,  permanent  update.  Two 
important  factors  militate  also  in  their  favour:  possibility  of  on-line  dialogue  and  free  of  charge  fin  several  cases)  due  to  an 
exploitation  at  an  experimental  and  promotional  stage. 

One  attends  also  to  a  multitude  of  more  specialized  and  smaller  glossaries  on  the  market,  to  be  chosen  by  the  translator 
according  to  his  needs  and  to  to  be  exploited  on  its  own  PC. 

The  bibliographical  data  bases  represent  by  definition  an  inexhaustible  and  update  terminological  source;  they  often  give 
satisfaction  where  traditional  methods  and  terminological  data  banks  did  not  provide  required  information.  These  data  bases 
index  almost  all  the  publications  in  all  the  fields  of  knowledge:  that  means  that  all  languages  are  represented  there.  There  are 
two  way  s  of  using  the  data  bases  for  terminological  research. 

1.  When  the  data  base  itself  is  bilingual  or  multilingual,  it  is  possible  to  obtain  an  answer  directly  .  Indeed,  in  this  case,  the  titles 
and  the  key  words  appear  in  it  in  all  languages  used.  One  can  quote  as  an  example  the  multidisciplinary  data  base  PASCAL 
which  is  multilingual  (French-English-German-Spanish).  There  is  also  the  data  base  TITUS  (textile)  proposed  in  three 
versions;  French,  English  and  Spanish  and  containing  therefore  directly  the  terminology  of  textile  industry  in  the  three 
languages.  This  first  way  of  using  data  bases  makes  it  possible  to  obtain  the  required  terms  in  languages  of  the  translation 
directly  during  consultation. 

Z.  The  second  alternative  concerns  languages  which  are  not  used  during  data  bases  production:  for  example.  Russian  or 
Japanese.  In  this  case  the  solution  consists  in  selecting  the  documents  which  cover  the  subject  submitted  to  the  translator  in 
source  and  target  language.  Contrary  to  the  translator  who  tries  to  document  himself  without  well  defined  starting  data,  the 
computer  provides  immediately  the  precise  and  relevant  references  of  the  documents  covering  the  subject  presenting  the 
required  terminology. 

The  advantages  of  this  approach  in  relation  to  the  translator's  traditional  methods  are  obvious;  speed,  relevance  of  the  answers 
and,  in  addition,  possibility  of  ordering  quickly  the  documents  selected  (by  the  same  computer)  if  the  closest  libraries  did  not 
have  them.  Considerable  additional  advantage:  the  documents  announced  by  the  data  bases  are  recent,  w  ritten  by  specialists  in 
their  language;  that  enables  the  translator  to  obtain  usual  terminology,  taken  in  its  context,  instead  of  that  of  the  dictionaries, 
wnich  is  sometimes  artificial  and  inappropriate. 

Moreover,  via  various  information  centres  or  specialized  offices,  terminological  data  banks  and  bibliographical  data  bases  are 
even  accessible  to  the  interested  parties  if  they  are  not  equipped  yet  in  computerized  tools 

The  comparison  between  investments  in  research  time  and  dictionaries  and  possible  investments  in  computer  research  is 
definitely  favourable  to  the  new  method. 

CONCLUSIONS 

Linguistic  difficulties  reduction  has  two  aspects: 

—  solution  of  the  information  user  problems. 

—  translator's  information  and  assistance. 

Linguistic  barriers  reduction  is  closely  dependent  on  the  organization  of  scientific  and  technical  information  and  encounters 
the  same  problems:  search  and  compilation  of  the  documents,  terminological  standardization. 

Progress  of  computerized  information  reduces  linguistic  barriers  by  imposing  only  one  common  language  and  terminological 
standardization. 

Linguistic  problems  study  shows  that  the  approach  of  the  problem  is  different  for  each  linguistic  commumtv  and  for  each  group 
of  languages. 

It  is  meanwhile  obvious  that  one  should  implement,  announce  to  the  interested  parties  and  use  all  the  means  aiming  to  facilitate 
comprehension:  cover-to-cover  translation,  DATA  BASE  translations,  etc.  It  is  as  important  that  information  producers  and 
providers  as  translators  undertake  this  task. 

It  appears  that  if  the  translator's  problems  are  complex,  the  modern  methods,  translators'  computerization  and  especially  good 
information  are  likely  to  solve  them.  The  creation  of  new  tools  for  the  translators  and  the  rational  use  of  the  information  placed 
at  their  disposal  is  inevitable  to  bring  up  to  date  translation  profession. 


Coordination  of  all  the  efforts  can  ensure  to  achieve  the  whole  measures  w  hich  w  ill  allow  better  comprehension  ot  information 
by  specialists,  creation  and  implementation  of  tools  adapted  to  facilitate  the  translation  process.  Linguistic  barriers  will  be 
considerably  reduced. 
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INTRODUCTION 

In  “this  paper,  I  will  try  to^show^that  effective  performance  cannot  be  assured  simply  by  installing 
expensive,  state  of  the  art  technology.  \,  Qu*  studies  of  many  applications  of  computing  and  information 
technology  have  included  both  successful  and  unsuccessful  examples The  clear  practical  lessoa  from 
this  work  is  that  the  successful  cases  have  usually  been  those  there  technical  change  has  been 
accompanied  by  appropriate  organizational  change.  The  less  successful  ones  have  generally  been  those 
where  projects  have  been  dominated  by  technical  considerations,  with  little  or  no  thought  given  to 
organizational  ones. 

These  lessons  seem  as  relevant  to  defence  installations  as  to  the  civilian  ones  in  which  our  research  has 
been  conducted.  The  common  characteristic  is  that  all  organizations  are  confronted  by  technical 
developments,  which  are  intended  to  help  them  meet  their  objectives.  For  them  to  succeed  in  that  task, 
it  appears  to  us  that  management  needs  to  ensure  that  three  big  hurdles  are  successfully  crossed.  These 
are: 


Managing  the  project 
Setting  the  right  objectives 
Changing  the  organization 

Each  of  these  hurdles  will  be  outlined  in  turn,  and  the  paper  concludes  with  some  proposals  on  how  they 
can  be  overcome. 

I* 


MANAGING  THE  PROJECT 
Who* 8  in  charge? 

A  safe  assumption  is  that  the  direction  a  project  takes  will  reflect  the  interests  ar.d  pref^ren^ci  ^f  the 
function  or  group  to  whom  t_p  management  allocate  project  leadership.  A  good  example  of  this  occurred 

when  a  manufacturing  company  set  out  to  re-organize  its  mainframe  computing  system,  involving  a  move  from 
^  a  heavily  centralized  system  to  one  in  which  computing  services  were  provided  at  each  of  their  several 

facilities . 

The  old  system  had,  as  is  usually  the  case,  been  run  by  the  finance  function,  and,  without  a  great  deal 
of  thought,  the  Board  decreed  that  the  project  to  establish  the  new  centralized  facility  would  also  be 
run  by  the  accountants.  The  new  system  worked  adequately:  but  production  and  other  line  managers 

argued  that  they  had  been  able  to  make  very  little  contribution  to  the  design  of  the  system.  The  effect 
was  that  in  designing  the  system  major  opportunities  were  missed  to  include  facilities  which  would  have 
been  of  great  value  to  production  staff  at  no  extra  cost. 

A  similar  debate  can  occur  over  whether  the  project  is  led  by  a  computer  expert  or  a  user.  In  general, 
successful  applications  have  been  those  where  the  users  have  been  in  a  dominant  position  in  the  project 
teams,  with,  of  course,  competent  computer  expertise  available.  A  sure  way  to  get  a  system  that  doesn’t 
meet  the  practical  requirements  of  a  function  is  to  give  too  much  authority  in  shaping  the  project  to  the 
technical  experts. 

Finally,  great  care  needs  to  b*»  taken  to  ensure  that  project  teams  as  a  whole  are  not  only  balanced  in 
terms  of  their  representation  from  Key  areas,  but  also  that  they  possess  adequate  teamworking  skills. 
We  know  of  one  project  which  failed  dismally  because  it  was  entrusted  to  a  project  team  which  was  not 
only  functionally  imbalanced,  but  which  lacked  the  skills  necessary  to  resolve  problems,  take  decisions, 
and  get  action. 


Who  do  they  consult? 

As  a  general  rule  it  is  worth  assuming  (until  it  is  proved  otherwise)  that  the  staff  most  closely 
involved  in  operating  existing  procedures  and  systems  have  the  best  understanding  of  them,  and  of  how  a 
more  advanced  computer  system  -  Id  help.  More  senior  staff,  at  a  distance,  tend  not  to  be  aware  of,  or 
to  have  forgotten,  the  practical  operating  realities  of  daily  work.  If  they  take  too  dominant  a  role  in 
designing  and  selecting  new  systems,  there  is  a  severe  risk  that  they  will  computerize  a  myth,  rather 
than  a  real  operation. 

A  successful  example  of  involving  staff  in  specifying  a  new  Information  system  occurred  in  a  food 
company.  They  had  previously  installed  a  traditional  management  information  system,  producing  huge 
quantities  of  print-out,  which  no-one  used.  A  new  senior  management  decided  to  abandon  this  system,  and 
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to  start  again.  They  set  up  a  series  of  temporary  teams  in  all  major  areas  of  the  factory  consisting  of 
both  operators  and  staff.  Everyone  was  given  the  opportunity  to  think,  through  their  job,  and  to 
indicate  what  information  would  help  them  to  perform  more  effectively.  This  basic  data  was  worked  into 
a  coherent  specification,  which  system  staff  then  used  as  their  target.  The  final  system,  radically 
different  from  the  one  it  replaced,  was  successfully  implemented,  and  continues  to  be  a  daily  help  to 
staff  at  all  levels. 

A  similar  approach  can  be  taken  when  the  decision  concerns  the  evaluation  and  purchase  of  standard 
systems.  Within  obvious  budgetary  or  compatibility  constraints,  experienced  staff  who  wi  1 1  uaing  the 
equipment  will  probably  make  a  better  choice  than  either  their  bosses  or  the  technical  experts,  and  not 
to  Include  them  In  the  decisions  on  what  to  to  buy  Is  to  ask  for  trouble. 


How  to  develop  competence? 

Contrary  to  popular  belief,  sophisticated  equipment  generally  needs  skilled,  trained  and  committed 
staff,  if  it  is  to  be  used  effectively.  Whilst  most  staff  will  be  able  to  cope  with  the  routine 
aspects  of  a  new  system  with  little  or  no  training,  many  of  the  potential  benefits  are  lost  if  deliberate 
steps  are  not  taken  to  develop,  and  to  keep  developing  competence. 

We  have  often  found  companies  which,  having  spent  very  large  sums  on  hardware  and  software,  regard 
training  as  a  cost  to  be  minimised  or  avoided  altogether  if  possible.  This  is  a  false  economy,  which 
many  have  acknowledged  in  their  discussions  with  us:  asked  what  lessons  they  have  learned  from  a 
project,  the  one  most  frequently  cited  is  that  they  would  spend  more  time  and  money  on  training. 
Appropriate  funds  need  to  be  built  into  the  overall  capital  budget,  to  ensure  that  acquiring  competence 
is  seen  as  being  just  as  Important  as  acquiring  technology. 

It  is  also  important  to  ensure  that  people  are  given  time  to  develop  skills.  If  they,  or  their 
managers,  are  put  under  constant  pressure  for  output  above  all  else,  training  time  will  Inevitably  be 
diminished.  Similarly,  reliance  on  other  operators,  also  under  production  pressure,  to  do  the  training 
rarely  works.  Such  people  can  certainly  help  the  training  and  familiarisation  process:  but  they  will 
experience  unreasonable  stress  if  they  are  expected  to  train  other  staff  properly  and  meet  their  own 
output  targets.  Finally,  it  is  important  to  consider  what  new  competencies  are  need  by  those  less 
directly  involved  with  the  new  system,  as  well  as  those  who  are  operating  it. 


SETTING  THE  RIGHT  OBJECTIVES 


An  operating  or  strategic  emphasis? 

Whatever  the  technical  characteristics  of  a  new  application  may  be,  it  is  management  which  establishes 
what  kind  of  contribution  it  will  make  to  the  business.  In  some  cases,  the  emphasis  has  clearly  been  on 
using  the  technology  to  improve  operating  performance  in  the  production  and  delivery  of  current  goods  and 
services.  The  benefits  sought  have  been  those  which  could  be  measured,  and  where  quantifiable 
improvements  over  current  practice  could  be  confidently  expected.  Thus  new  technology  is  used  if  it 
promises  to  reduce  labour  costs,  save  energy,  overcome  bottlenecks  in  production,  reduce  scrap  and  so  on. 
In  the  same  vein,  computer  technology  has  often  been  seen  as  a  way  of  increasing  the  amount  of  control 
which  can  be  exercised  over  operations,  by  allowing  procedures  to  be  specified  more  precisely,  to  reduce 
dependence  on  scarce  skilled  labour,  or  to  ensure  a  more  even  and  regular  pace  of  work. 

Elsewhere,  the  objectives  have  been  different.  Technology  has  been  introduced  to  secure  less  tangible 
benefits,  such  as  improving  the  quality  and  timeliness  of  information  available  to  managers,  with  the 
intention  of  enabling  more  confident  decisions  to  be  made.  In  other  cases  technical  changes  have  been 
used  to  otter  new  or  significantly  enhanced  products  or  services,  and  thus  radically  shift  the  position 
of  the  organization  in  the  eyes  of  its  customers  or  clients. 

We  have  distinguished  these  two  approaches  as  having  either  an  ’operating"  or  a  "strategic"  emphasis 
(2,  3,  4).  Which  approach  is  taken  is  of  vital  significance  to  the  way  the  capabilities  of  IT  are 
used  In  the  organization,  but  is,  by  and  large,  independent  of  the  technology.  The  key  factor  is  senior 
managers'  awareness  of  the  scope  for  strategic  uses  of  emerging  technology,  and  Its  willingness  to  embark 
or.  this  much  more  difficult  approach. 

Adopting  a  strategic  approach  is  a  much  greater  challenge  than  that  of  using  technology  primarily  to 
improve  current  operations.  It  presents  these  responsible  for  a  project  with  new  conceptual  and 
organizational  challenges.  For  example,  managers  need  to  have  a  global  and  consistent  picture  of  the 
enterprise,  and  of  how  technical  developments  can  be  implemented  which  will  radically  change  the  kind  of 
tasks  it  Is  able  to  do,  and  the  services  it  is  able  to  provide. 

Exploiting  this  potential  also  requires  managers  who  are  able  to  conceive  the  operation  of  the 
organization  as  a  whole,  unconstrained  by  established  departmental  or  other  structures.  Most  managers 
are  familiar  with  well-established  lines  of  authority,  and  vertical  information  flows.  Technology  makes 
it  possible  to  move  information,  independently  of  established  structures  -  and  an  ability  to  see  and  value 
this  possibility  Is  essential  if  the  strategic  capabilities  are  to  be  achieved. 
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Does  the  system  fit  the  objectives? 


Even  if  senior  staff  are  aware  of  the  strategic  possibilities ,  there  is  a  danger  that  those  in  charge  of 
procurement  and  installation  unintentionally  subvert  those  objectives.  One  company  with  which  we  worked 
planned,  at  senior  level,  to  introduce  an  elaborate,  highly  automated  manufacturing  system,  with  a  very 
clear  set  of  strategic  objectives  in  mind  (quality,  rapid  response  to  design  changes,  ability  to  meet 
small  orders  quickly  and  so  on).  Unfortunately,  these  objectives  were  not  fully  appreciated  down  the 
line,  where  a  series  of  decisions  was  taken  about  the  details  of  the  hardware,  software  and  operating 
layout,  which  clearly  reflected  an  emphasis  on  cutting  costs  rather  than  adding  value.  They  ended  up 
with  an  expensive  system  which,  in  practice,  provided  few  benefits  over  that  which  it  replaced;  and 
certainly  was  not  consistent  with  the  strategic  vision  which  had  prevailed  at  the  start  of  the  project. 


Does  the  Organization  encourage  strategic  action? 

Senior  management  has  a  major  role  to  play  in  creating  a  climate  within  the  organisation  that  encoi  rages 
innovation  in  support  of  strategic  objectives.  Staff  at  middle  levels  need  to  be  willing  to  make  the 
effort  and  take  the  risks  associated  with  technical  change.  Acting  as  the  promoter  or  champion  of  a 
major  technical  change,  especially  if  it  needs  to  be  accompanied  by  organisational  change,  puts  the 
person  concerned  in  a  highly  vulnerable  position.  They  need  to  master  not  only  the  technical  aspect,  but 
also  be  willing  to  argue  the  case  through  the  organization,  often  in  the  face  of  indifference  or  outright 
opposition. 

Why  should  they  bother?  In  many  organisations,  rewards  go  to  those  who  play  to  the  rules,  keeping 
existing  practices  running  smoothly,  and  who  don't  stir  things  up.  Such  organizations  are  unlikely  to 
encourage  people  to  take  vulnerable,  risky  positions;  to  that  extent,  they  are  unlikely  to  cross  the 
hurdle  of  establishing  coherent  strategic  objectives  for  their  technological  advances. 


CHANGING  THE  ORGANIZATION 

The  more  scope  there  is  for  computer  technology  to  make  possible  big  changes  in  the  services  offered, 
the  more  likely  it  is  that  there  will  need  to  be  changes  in  the  shape  and  style  of  the  organization. 
Decisions  need  to  be  taken  on  these  matters,  as  well  as  on  those  of  a  technical  nature.  What  are  the 
options,  and  what  experience  is  there  elsewhere? 


Structures  and  Relationships 


Technical  change  makes  it  possible  to  eliminate  some  functions,  alter  the  importance  of  others,  and  to 
create  new  ones  -  it  becomes  possible,  and  perhaps  essential,  to  shake  up  existing  boundaries  of 
responsibility,  and  existing  relationships.  For  example,  advanced  manufacturing  systems  are  greatly 
enhancing  the  role  of  support  facilities  like  planning,  tooling  and  maintenance.  Because  of  the  much 
greater  dependence  of  line  managers  on  these  facilities,  two  companies  we  have  worked  with  are  changing 
structures  and  reporting  relationships  to  make  these  traditionally  separate  support  systems  closer  to, 
and  more  answerable  to,  line  management. 


Technical  change  also  makes  it  possible  to  link  together  successive  stages  of  a  process  more  closely, 
with  data  being  transmitted  automatically  around  the  organization.  There  is  a  strategically  important 
choice  here:  because  a  thing  is  technically  possible  does  not  mean  it  has  to  be  implemented.  For 
example,  a  hospital  laboratory  could  have  linked  its  automated  analytical  equipment  directly  to  a  newly 
installed  computer  system  which  stored  information  on  patients,  processed  the  results,  and  produced  the 
reports  on  individual  samples.  Such  a  link  would  have  made  the  later  stages  of  the  process  somewhat  more 
convenient:  but  it  was  rejected  as  the  analytical  process  would  then  have  been  completely  dependent  on 
the  computer  being  up  and  running  before  work  could  start.  This  was  judged  on  unacceptable  risk,  and  a 
decision  was  taken  to  include  a  deliberate  ’’gap”  in  the  system  which  would  be  filled  by  an  operator, 
rather  than  introduce  a  fully  automated  procedure. 


Management  style 


Choices  also  arise  over  how  the  management  style  of  an  organization  is  changed  as  technology  is 
introduced.  For  example,  computer  technology  can  be  used  either  to  centralize  decision-making,  or  tc 
decentralize  it.  The  temptation  is  often  to  take  the  former  option,  those  in  positions  of  power 
generally  taking  the  view  that  the  more  influence  and  control  they  have,  the  better  the  organization  will 
work. 


The  fallacy  of  this  presumption  is  well  illustrated  by  the  example  of  a  decision  taken  by  the  Board  of  a 
multi-national  company  not  to  increase  the  centre’s  knowledge  of  the  periphery.  They  were  building  a  new 
headquarters  building,  in  which  they  would  incorporate  a  significant  amount  of  office  automation.  All 
the  suppliers  whom  they  invited  to  submit  proposals  try  to  persuade  the  company  to  install  systems  which, 
as  well  as  automating  routine  head  office  functions,  would  enable  them  to  have  direct  computer  links  to 
all  of  their  overseas  subsidiaries.  This  would  have  enabled  head  office  to  receive  much  more  frequent 
reports  on  matters  like  sales,  or  operating  costs  in  their  remote  plants,  thus,  it  was  argued,  improving 
management  control. 


The  Board  decisively  rejected  such  notions.  The  success  of  the  business,  they  felt,  had  been  based  on 
giving  substantial  autonomy  to  local  managers.  They  were  accountable  to  the  company  in  broad  terms,  and 
there  was,  of  course,  close  monitoring  of  their  quarterly  and  annual  figures.  But  within  those 
constraints  they  were  left  to  run  the  business.  Putting  in  direct  computer  links  to  head  office  would 
undermine  that  autonomy,  and  was  therefore  not  implemented  when  the  rest  of  the  office  automation  system 
was  installed. 


Work  Organization 

Having  skilled  committed  people  depends  on  ensuring  that  their  jobs  are  organized  in  a  way  that 
complements  rather  than  replaces  human  skill  and  experience.  Many  of  the  less  successful  applications 
of  computer  technology  have  been  those  where  work  has  been  organized  in  a  way  that  has  led  to  boredom, 
inattention  and  loss  of  interest  amongst  the  staff  concerned.  Such  attitudes  cannot  produce  high 
performance . 

Conversely,  where  decisions  on  work  organization  have  ensured  that  the  skills  and  experience  of  staff 
have  been  complemented  by  the  technology,  the  effects  have  been  highly  beneficial.  In  such  cases,  staff 
have  shown  themselves  able  and  willing  to  accept  responsibility,  to  work  to  overcome  temporary 
difficulties  in  the  system,  to  suggest  valuable  additional  uses,  and  so  on.  By  organizing  work  in  a  way 
that  satisfies  well  established  motivational  requirements  of  individual  jobs,  and  grouping  them  together 
into  natural  work  teams,  companies  have  been  able  to  achieve  far  more  benefits  from  technical  innovation 
than  where  such  factors  have  been  ignored. 


GUIDELINES  FOR  INNOVATION 

In  order  to  overcome  these  hurdles  which  lie  in  the  way  of  successful  innovation,  the  following 
guidelines  are  proposed.  They  are  based  on  successful  practices  in  a  wide  range  of  organizations,  and 
can  serve  as  a  checklist  for  those  charged  with  introducing  technical  change. 


Nine  guidelines  for  exploiting  the  innovative  potential 
of  new  computing  technologies  and  Information  systems 


PURPOSE: 

1.  Ensure  that  the  use  of  new  technology  has  a  clear  strategic  focus  which 

targets  long  term  market  objectives  and  not  just  current  internal  operating  problems. 

2.  Review  and  implement  positive  policies  in  areas  like  employment  practice  and  investment 
appraisal,  so  that  they  encourage  innovation  and  the  strategical  use  ot  new  technology. 

3.  When  evaluating  hardware  and  software  options,  make  sure  that  you  choose  kit  to  fit  the  current 
and  anticipated  needs  of  the  business. 


PEOPLE : 

A.  Review  work  organization  in  relation  to  new  technology,  to  encourage  flexibility,  creativity  and 
skills  development. 

3.  Review  management  styles  and  working  arrangements,  to  ensure  that  these  are  consistent  with  the 
strategic  goals  of  the  business,  and  with  achieving  the  potential  benefits  of  new  technology. 

6.  Design  support  systems  which  are  consistent  with  strategic  aims,  and  which  enable  support 
staff  to  contribute  in  flexible  and  creative  ways  to  the  changing  needs  of  the  organization. 


PROCESS : 


7.  Establish  a  clear  project  management  responsibility  to  guide  the  often  protracted 

implementation  process. 

8.  Plan  the  nature  and  timing  of  user  Involvement,  to  ensure  that  key  staff  establish  "ownership” 
of  new  equipment  and  procedures. 


9.  Develop  a  systematic  training  plan  to  equip  users  at  all  levels  with  the  competence,  in  the 
form  of  new  skills  and  knowledge,  they  need  to  exploit  the  innovative  potential  of  the  new 
technology . 
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Summary 

Standards  for  representing  and  encoding  data  as  well  as  standards  for  the 
interconnection  of  systems  having  different  hardware  and  software  are  both 
essential  for  effective  electronic  information  transfer.  To  date,  although  major 
strides  have  been  made  internationally  on  standards  in  both  the  data  and 
telecommunications  areas,  much  remains  to  he  done  before  effective  electronic 
information  transfer  can  take  place  either  nationally  or  internationally.  This 
paper  addresses  the  difficulties  in  achieving  approved  standards  i nternat i onal ly , 
the  state  of  the  art,  and  some  new  standards  that  potentially  will  have  a  positive 
impact  on  information  transfer  such  as  those  pertaining  to  electronic  publishing. 
It  also  addresses  the  Open  Systems  Interconnection  Reference  Model,  a  suite  of 
telecommunications  standards  which  hold  great  promise  for  the  interconnection  of 
systems  with  different  hardware  and  software.  Finally,  the  paper  discusses  the 
work  that  has  been  done  to  date  to  implement  these  standards  for  library  and 
information  applications  and  challenges  for  the  future. 


Standards  have  been  discussed  either  implicitly  or  explicitly  by  previous  speakers  as  an  important, 
element  in  reducing  harriers  to  information  delivery.  Therefore,  my  paper  is  intended  as  a 
overview  of  the  groups  of  standards  which  are  pertinent  to  electronic  information  transfer  and  the 
current  state  of  these  standards  and  their  implementation. 

Standards  which  are  essential  to  achieving  effective  electronic  information  transfer  fall  into  two 
broad  categories:  technological  standards  and  standards  for  the  encoding  of  the  data  which  is 
stored,  retrieved  and  transmitted  by  the  technology. 

During  the  past  five  years,  innovations  in  technology  have  made  us  increasingly  aware  of  the 
possibilities  and  the  need  for  provision  of  effective  on-line  as  well  as  off-line  transmission  of 
data  to  remote  sites. 

In  1977,  the  Cordite  consul tatif  international  t.el  eqraphi  que  et  telephonique  (CC1TT)  began  work  on 
a  suite  of  international  standards  designed  to  provide,  not  only  interconnection,  but  also 
intercommunication  between  hardware  and  software  of  different  types  and  makes.  This  suite  of 
standards  is  under  the  umbrella  of  the  Open  Systems  Interconnection  '(1SI)  Reference  Model.  Very 
early  in  the  process  the  CCITT  and  the  International  Organization  for  Standard! ration  (ISO) 
jointly  endorsed  the  OS  I  Reference  Model:  the  first  time  in  history  there  had  bepn  such  a  joint 
endorsement  of  a  standard  between  these  two  organizations. 

There  are  several  important,  characteristics  about,  this  suite  of  standards  which  make  it 
potentially  very  powerful.  First,  the  Reference  Model  is  a  7  layer  model  where  each  layer  is 
dependent  on  the  layer  below,  (see  Figure  I).  These  standards  specifications  must  he  implemented 
within  software  on  each  machine  that  needs  to  communicate  with  other  machines.  The  standards  are 
designed  as  communications  standards,  therefore  they  logically  work  between  the  front-end  of  one 
computer  and  another,  and  are  invoked  only  when  computer -to-comput.er  communication  is  required. 

In  other  words  within  your  own  system  you  are  free  to  manipulate  data  as  you  wish,  as  long  as  that 
data  can  be  translated  into  the  standard  data  definitions  and  communications  format  when  required. 
Therefore,  local  autonomy  is  preserved:  an  important  ingredient  to  the  success  of  any  standard. 

Secondly,  the  development  of  the  OS 1  standards  has  from  the  beginning  been  driven  and  supported  by 
the  telecommunications  and  the  hardware  and  software  industry.  This  is  important  to  us  as 
potential  users  of  the  standards  because  it  means  that  software  implementations  of  them  will  he 
available  "off  the  shelf"  from  the  industry.  Moreover,  industry  members  have  created 
organizations  to  ensure  that  implementations  of  these  standards  across  the  industry  can,  in  fact, 
interwork.  The  Corporation  of  ripen  Systems  (CDS)  in  North  America,  the  Standard  and  Promotion  and 
Application  Group  (SPAG)  in  Europe,  and  Promotion  of  Open  Systems  Interconnection  (PDSI)  in  Japan 
have  all  been  created  within  the  past  five  years  in  order  to  achieve  this  goal.  An  international 
umbrella  organization  for  these  regional  orqani zations  is  currently  being  set  up. 

Since  the  OSI  suite  of  standards  is  a  very  complex  and  hierarchical  one,  we,  as  users,  could  not 
afford  the  investment  necessary  to  implement  all  seven  layers  of  them  if  the  substantial 
development  costs  were  not  ahle  to  he  amortized  in  large  part,  by  the  computer  industry. 
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However,  this  is  not  to  say  that  as  potential  users  of  OSI  standards  we  can  simply  sit  back  and 
wait  for  them  to  Be  available.  At  the  application  (7t.h)  layer  the  standards  for  the  software  are 
application  specific  and  therefore  need  to  be  developed  within  specific  user  industries.  These 
application  level  standards  or  protocols  define  the  messages  to  be  sent  between  computers,  the 
format  of  the  messages  and  their  sequence  for  each  application.  We  in  the  information  sector  need 
to  be  concerned  with  the  development  of  OSI  protocols  for  book  order,  for  interlibrary  lending, 
for  document  delivery,  etc.  and  need  to  have  input  ino  the  development  of  more  generic  application 
protocols  such  as  File  Transfer  and  Management  (FTAM',  Information  Search  and  Transfer, 

Distributed  Directories  protocols  etc. 

The  National  Library  of  Canada  has  since  1979,  had  a  research  and  development  team  working  on  the 
development  of  OSI  library  and  information  application  layer  protocols.  To  date,  we  have 
developed  and  implemented  a  file  transfer  protocol  to  transfer  Marc  records  between  an  IBM  and 
CYBER,  GEAC  and  Honeywell  machines  each  running  different  software  and  operating  on  systems  at.  the 
NLC  and  3  Canadian  universities.  This  software  has  been  in  operation  since  November  1984  on  the 
four  machines.  An  interlibrary  loan  messaging  protocol  has  also  been  developed  and  implemented  at 
the  NLC  to  allow  tne  automatic  sending  and  receipt  of  interlihrary  loan  messages  as  well  as  the 
tracking  of  them.  This  application  layer  protocol,  which  is  now  in  the  Canadian,  American  and 
international  standards  process,  has  been  implemented  using  electronic  mail  for  the  lower  six 
layers  of  the  protocol  . 

In  addition  work  is  underway  on  a  hook  order  protocol,  a  cataloguing  application  and  a  series  of 
protocols  for  distributed  directory  systems.  The  National  Library  is  currently  working  with  GEAC 
Computer  Corporation  to  implement  the  protocols  on  the  new  software  they  are  designing. 

During  this  same  period  the  Linked  Systems  Project  (LSP)  in  the  II. S. A.  was  formed  consisting  of 
the  Library  of  Congress,  the  On-line  Computer  Library  Centre  (OCLC)  Inc.,  Research  Libraries  Group 
(RLG)  ,  and  the  Western  Library  Network  (WLN).  They  have  developed  an  Information  Search  and 
Retrieval  protocol  and  a  record  transfer  protocol  which  have  heen  in  operation  since  1986. 

While  there  has  heen  constant  liaison  between  the  work  at  the  NLC  and  that  of  the  LSP,  the 
approach  has  been  somewhat  different. 

In  the  U.S.  they  cho  e  to  build  themselves  layers  four  to  six  of  the  protocols,  relying  on  the 
telecommunications  carriers  for  layers  one  to  three  only,  while  in  Canada  we  chose  to  use  what  was 
available  from  the  telecommunications  industry  for  all  of  layers  1  to  6,  secure  in  the  knowledge 
that  our  national  carriers  were  committed  to  implementing  OSI.  NLC  will  convert  to  the  pure  OSI 
protocols  for  these  lower  layers  in  concert  with  the  telecommunications  companies.  The  other 
major  difference  is  that  all  LSP  protocols  have  been  developed  using  the  connection-oriented  mode, 
while  in  Canada,  with  the  exception  of  file  transfer,  all  the  protocols  have  been  developed  using 
the  store  and  forward  (electronic  mail)  mode  of  connection  (see  Figure  2).  Both  modes  are  OS t 
compatible  and  as  time  goes  on  these  differences  will  be  resolved.  Already  the  agreement,  between 
NLC  and  LC  to  work  toward  common  protocol  standards  for  library  applications  is  beginning  to  hear 
fruit  and  it  is  anticipated  that  snortly  the  Canadian  Interlibrary  Loan  protocol  and  the  U.S. 
Information  Search  and  Retrieval  protocol  will  both  be  North  American  standards. 

On  a  more  general  level,  the  wide  implementations  of  the  DSI  electronic  nail  standard  ( CC ITT 
Recommendation  X.400)  which  is  already  underway  will  lead  to  the  interconnection  of  electronic 
mail  systems  nationally  and  internationally.  This  development,  will  provide  possibilities  for  new 
and  improved  delivery  of  information. 

In  my  opinion,  OS!  will  be  the  communications  technology  of  the  next  decade  for  on-line 
communication,  bringing  with  it  a  very  substantial  reduction  in  the  current  barriers  to 
information  transfer. 

However,  it  is  essential  that  we  understand  that  the  ability  to  connect  systems  alone  will  not 
achieve  useful  results  -  in  addition,  the  data  content  and  format  transmitted  must  be  in  standard 
form  for  intercommunication  to  take  place.  In  this  regard,  the  library  and  information  community 
are  fortunate  in  having  already  achieved  a  high  degree  of  concensus  on  data  and  format  standards 
internationally.  This  achievement  positions  the  community  well  to  take  advantage  of  OSI 
technology.  Sectors  which  do  not  yet  have  widely  recognized  standards  for  data  will  find  their 
ability  to  take  advantage  of  this  technology  for  their  applications  circumscribed. 

In  the  last  fifteen  years  we  have  achieved,  and  have  widely  implemented,  several  standards  for 
encoding  bibliographic  data  in  machi ne-readabl e  form.  Spearheaded  by  the  library  profession  there 
are  the  International  Standard  Bibliographic  Descriptions  (ISRD's)  for  nearly  all  types  of 
materials  and  these  are  used  globally.  These  de  facto  standards  prescribe  the  recording  of  title 
page  information  but  not  how  to  provide  access  points  to  a  bibliographic  record.  They  also 
differentiate  the  data  elements  in  a  record  by  the  use  of  punctuation  symbols  which  are 
part i cul ari ly  useful  for  display  of  data  recorded  in  foreign  languages. 

Further,  the  International  Standard  Book  Number  (ISBN)  and  the  International  Standard  Serial 
Number  (ISSN)  are  both  ISO  standards  and  are  widely  implemented  and  used  by  libraries,  publishers 
and  distributors,  and  by  the  abstracting  and  indexing  community.  In  the  H.S.A.,  the  ISSN  is  now 
required  for  registering  serials  t.o  qualify  for  reduced  postal  rates,  and  the  ISSN  will  also  be 
used  on  bar  codes  and  article  number  identification  within  serials  in  the  USA,  UK  and  elsewhere. 
The  ISRN  is  already  widely  in  use  in  barcoding  schemes. 

The  MARC  format  for  machine-readable  encoding,  while  imperfect,  has  been  widely  accepted  as  the 
communi cat i ons  format,  for  the  tape  exchange  of  bibliographic  data  amongst  libraries.  It  is  based 
on  an  international  standard-ISO  2709. 


Designed  originally  by  the  Library  of  Congress,  other  national  versions  of  this  format  emerged: 
CANMARC,  UKMARC. ,  INTERMARC  (France),  AUSMARC  and  MABEIN  (West  Germany)  are  examples.  Recently 
there  has  been  agreement  international  ly  on  a  superse"'  format,  UNIMARC,  which  is  increasingly 
being  used  for  international  tape  exchange. 

The  abstracting  and  indexing  community  commonly  use  the  UNI S I  ST  Reference  Manual  for  recording 
contents  of  articles  in  serials,  and  the  development  by  UNESCO  of  the  Common  Command  Format  (CCF) 
has  provided  a  communications  format  for  this  data. 

While  these  standards  greatly  facilitate  a  reasonably  effective  method  of  communicating 
bibliographic  data  nationally  and  internationally  for  machine-readable  tape  transfer,  there  are 
still  problems.  The  UN  I S I  ST  Reference  Manual  and  the  CCF  are  not  totally  compatible  with  UNIMARC, 
and  perhaps  never  will  be  because  their  underlying  purpose  is  different.  Although  the  ISBN, 
because  of  its  simple  structure  (one  edition  of  a  book,  one  number),  has  been  able  to  be 
implemented  simply  by  assigning  blocks  of  unigue  numbers  to  publishers  who  then  assign  them  to 
their  publications,  the  ISSN  has  been  more  difficult  to  manage.  Because  serials,  by  their  nature, 
change  title,  merge,  split,  etc.  the  ISSN  must  be  hibl iographical ly  controlled  by  national  and 
ultimately  an  international  agency  who  maintain  an  identification  record  for  serials.  This  is  the 
International  Serials  Data  System,  a  UNESCO  program  hosted  by  the  French  government  with  currently 
4S  national  centres  throughout  the  world  feeding  the  International  Centre  information  made 
accessible  via  the  ISDS  Register  of  Seria'  ,'jhl i cati ons .  While  this  register  of,  currently, 
370,000  records  is  a  tribute  to  international  cooperation,  problems  exist  mainly  due  to  delays  in 
assigning  and  making  accessible  the  assignment  of  ISSN's  and  key-titles.  Work  is  underway  to 
alleviate  these  problems,  but  until  timeliness  is  accomplished,  the  use  of  ISSN  as  a  global 
identification  number  for  serials  will  not  he  as  widespread  as  is  needed. 

In  spite  of,  or  because  of,  the  increased  activity  in  international  data  definitions  and  format 
standards  in  several  sectors  of  the  information  industry  coupled  with  the  OS l  standards  activity, 
there  will  be  an  .ncreased  need  to  convert  data  from  one  standard  to  another  especially  when 
communicati ng  across  various  sectors  of  the  information  industry.  Current  methods  of  format 
conversion  are  time-consuming  and  slow  and  must  be  improved  if  we  are  to  be  able  to  exploit  the 
potential  of  on-line  data  transfer. 

Hence  work  has  been  done  in  the  government,  of  Canada  to  use  an  artificial  intelligence  technique 
called  inferencing  to  write  software  to  improve  the  format  conversion  process.  The  prototype  has 
been  successfully  developed  hut  more  work  is  needed  before  such  developments  can  aid  the 
conversion  of  machine  readable  formats. 

Equally  the  need  for  a  means  of  standardizing  search  arguments  and  keys  for  searching  different 
data  bases  has  long  been  recognized  as  a  desirable  tool  for  easier  use  of  multiple  data  bases. 
Various  attempts  such  as  the  creation  of  a  common  command  language  (CCL)  or  mediation  software, 
have  met  with  some  resistance  from  information  providers  who  felt  its  implementation  would  reduce 
the  specificity  and  clarity  of  their  search  strategy.  Recently,  a  Standard  Query  Language  (SOL) 
has  been  approved  international ly  which  allows  sophisticated  search  arguments  to  be  programmed, 
but  is  not  very  "user-friendly".  SQL  and  CCL  may  he  used  in  conjunction  with  each  other,  CCL 
being  the  display  mechanism. 

In  addition  to  format  converter  software  being  potentially  a  very  useful  tool  for  conversion  of 
record  formats,  it  is  possible  that,  in  future  such  software  could  also  be  employed  to  convert  a 
search  query  into  formats  appropriate  for  searching  data  bases  employing  differing  search 
strategies  and  thus  largely  obviate  the  need  for  techniques  such  as  a  common  command  language.  If 
this  supposition  proves  correct  and  the  process  proves  economical,  the  impact  of  such  software  on 
improved  service  could  be  substantial.  At  present,  it  is  not  possible  to  predict  whether  either 
or  both  of  the  above  methods  will  prove  efficient  or  effective. 

Further,  major  strides  are  being  made  in  the  field  of  data  definition  for  electronic  publishing. 
For  the  sake  of  discussion  it  is  useful  to  divide  electronic  publishing  into  two  processes, 
electronic  data  creation,  and  electronic  dissemination.  Greater  progress  has  been  made  to  date  in 
the  former  than  the  later. 

The  acceptance  by  ISO  of  the  Standardized  General  Markup  Language  (SGML)  in  1986  has  provided  a 
standard  for  encoding  data  for  word  processing  packages,  thereby  allowing  discs  to  be  read  on  more 
than  one  make  of  word  processing  software  during  the  editing  and  publishing  process.  Further  the 
Association  of  American  Publishers  ( AAP }  in  the  U.S.  has  developed  a  standard  for  electronic 
manuscripts  which  is  an  application  of  the  SGML  standard  and  which  will  provide  a  common  means  of 
describing  front  matter  and  back  matter,  (i.e.  title  page  table  of  cortents,  and  reference 
information)  for  an  electronic  manuscript.  This  standard,  which  will  he  submitted  to  ISO  shortly, 
will  be  of  great  importance  to  those  of  us  concerned  with  control  of  and  access  to  publishing  in 
electronic  form  because  it  will  provide  a  standard  means  of  providing  bibliographic  control  for 
electronic  publications.  It  is  called  the  Electronic  Manuscript  Preparation  and  Markup  standard. 
It  would  appear  that  CDROM  may  be  a  more  widespread  method  of  publishing  electronically  than 
on-line  access,  although  both  will  undoubtedly  exist. 

CDROM  and  optical  disc  technology  is  a  technological  development  which  is  complementary  to  the  OS  I 
developments.  Optical  disc  technology  has  great  potential  for  the  dissemination  of  large 
quantities  of  information  without  the  need  to  rely  on  tel ecommunications  services,  and  with  the 
associated  cost  savings,  parti cul ari 1 y  internationally.  While  agreement  has  been  reached  on 
standards  for  imprinting  the  discs,  such  agreement  is  still  lacking  for  the  software  to  retrieve 
information  from  the  disc.  Until  a  retrieval  standard  is  in  wide  use,  the  ability  to  search  discs 
from  multiple  vendors  will  he  cumbersome  and  costly,  and  will  detract  from  the  use  of  optical  disc 
technology  for  information  transfer. 
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In  summary,  increased  activity  in  both  technological  and  data  standards  development  world-wide 
mirrors  a  recognition  that  without  both  data  and  telecommunications  standards  it  will  be 
impossible  to  develop  effective  and  efficient  mechanisms  for  the  interchange  of  data.  The  barrier 
of  lack  of  standards  to  effect  information  exchange  is  being  slowly  overcome  by  the  dedicated  work 
of  experts  from  many  countries.  It  is  incumbent  upon  us,  the  users,  to  be  involved  in  this 
process  to  ensure  the  standards  meet  our  needs  and  also  to  implement  them  faithfully  in  order  to 
provide  our  clientele  with  ready  and  timely  access  to  the  information  they  require,  regardless  of 
the  data  source. 


FIGURE  1 
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TWO  MOOES  OF  INTERCONNECTION 


OSI  module 


OSI  module 


a)  Connection-oriented  -  single  session  per  transaction. 


OSI  module  MHS  MHS 


OSI  module 


b)  Connectionless  or  message-oriented  -  separate  sessions  to  complete  a  transaction. 
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NOTES  ON  THE  TRANSLATIONS 


The  Information  and  Documentation  Department  of  Aerospatiale  at  Suresnes  near  Paris  is  currently  experimenting  with 
machine  translation,  using  SYSTRAN.  Since  one  of  the  main  barriers  considered  during  this  meeting  was  the  barrier  of 
language,  they  kindly  offered  to  translate  into  English  the  two  papers  that  were  presented  in  French,  and  into  French  one  of  the 
English  language  papers.  For  the  latter,  we  selected  the  paper  by  Professor  Zimmerman  (No.4),  because  it  was  concerned,  to  a 
large  extent,  with  machine  translation.  AGARD  is  extremely  grateful  to  Aerospatiale  for  the  time  and  trouble  they  have  taken 
over  this  task. 

This  section  of  the  Conference  Proceedings  contains  comments  by  Aerospatiale  (in  French  and  in  English)  and  examples 
of  the  raw,  unedited,  output  from  SYSTRAN,  ft  must  be  stressed  firstly  that  SYSTRAN  is  only  one  machine  translation  system 
—  there  are  a  number  of  others  —  and  secondly  that  the  examples  given  were  translated  without  the  making  of  a  special 
dictionary  for  the  subject  matter.  When  reading  the  translations  it  must  also  be  borne  in  mind  that  the  raw'  translation  output  by 
the  computer  is  intended  merely  to  give  an  end-user  (scientist,  engineer,  manager,  etc.)  a  translation  that  is  sufficiently  good  for 
him  to  grasp  the  meaning  of  the  original  document. 

The  times  shown  in  the  table  on  page  T-3  give  an  indication  only  of  the  results  of  a  single  experiment.  They  should  not  be 
considered  as  being  generally  applicable  to  all  translations.  The  comparison  figures  given  in  the  last  column  of  the  table  are 
estimates  of  how  long  a  professional  translator  might  take  to  produce  a  raw  quality  translation  from  the  same  original  text. 


COMMENT  AIRES  SCR  LES  TRADUCTIONS  PRESENTEES 


La  Division  d'fnformation  et  de  Documentation  d' AEROSPATIALE  a  Suresnes  precede  actuellement  a  des  experiences 
de  traduction  machine  a  l'aide  du  systeme  SYSTRAN.  Etant  donne  que  1‘une  des  principales  barrieres  a  la  communication 
etudiee  tors  de  la  Conference  a  ete  la  barriere  de  la  langue,  AEROSPATIALE  a  propose  de  faire  traduirc  vers  l'anglais  les  deux 
communications  presentees  en  frangais  et  de  faire  traduire  I'une  des  communications  presentee  en  anglais  vers  le  frangais.  a 
l'aide  du  systeme  SYSTRAN.  En  ce  qui  concerne  cette  derniere  proposition  nous  avons  choisi  la  communication  du  Professeur 
ZIMMERMAN  (No.4)  puisqu’elle  concerne  principalement  la  traduction  machine.  L'AGARD  est  tres  reconnaissant  envers 
1'AEROSPATIALE  du  temps  passe  a  1'accomplissement  de  cette  tache.  et  de  la  peine  qu  elle  s'est  donnee  pour  la  mener  a  bien. 

Cette  partie  du  compte-rendu  de  conference  comporte  des  commentaires  emis  par  AEROSPATIALE  (en  frangais  et  en 
anglais)  et  quelques  exemples  de  traductions  SYSTRAN  brutes  et  sans  post-edition.  II  faut  d'abord  souligner  que  SYSTRAN 
n'est  qu’un  systeme  de  traduction  machine  parmi  d'autres  et  que  les  exemples  donnes  ici  ont  ete  traduits  sans  la  compilation 
prealable  d'un  dictionnaire  specialise.  Lors  de  la  lecture  des  traductions  il  doit  etre  tenu  compte  du  fait  que  la  traduction 
"brute"  de  Tordinateur  est  destine  a  un  utilisateur  (scientifique.  ingenieur.  gerant  etc.)  qui  n'a  besoin  que  de  comprendre  le  sens 
general  du  document  original. 

Les  temps  passes  a  faire  les  traductions,  indiques  au  tableau  ci-apres.  ne  representent  que  les  resultats  d  une  seule 
experimentation.  Ils  ne  sont  point  applicable  a  toutes  les  traductions.  Les  chiffres  indiques  a  la  derniere  colonne  representent 
des  estimations  du  temps  que  passerait  un  traducteur  pvofessionnel  a  faire  une  traduction  "brute"  de  qualite  egale  a  partir  du 
meme  texte  original. 
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COMMENTAIRES  GENERALIX 
par 

Oleg  Lavroff 
Societe  Aerospatiale 

Chef  du  Departement  Information  —  Documentation 
B.P.No.76 

92152  Suresnes  Cedex 
France 

L'experimentation  menee  a  I'AEROSPATI  ALE  a  pour  but  de  pouvoir  diffuser  a  terme  le  plus  rapidement  possible  et  a 
moindre  cout  la  traduction  de  textes  presentant  un  interet  immediat  et  contenant  des  informations  necessaires  a  la  bonne 
marche  de  ['enterprise. 

L'experimentation  est  menee  avec  le  systeme  SYSTRAN.  Les  textes  joints  montrent  le  niveau  actuel  de  developpement 
de  ce  systeme  et  mettent  en  evidence  les  points  particuliers  qui  doivent  etre  developpes  (elargissement  des  dictionnaires  actuels 
utilises  par  SYSTRAN)  et  ameliores  (amelioration  du  systeme  a  partir  des  analyses  effectuees  par  les  traducteurs). 

Au  stade  actuel,  les  traductions  brutes  ne  sont  pas  suffisamment  comprehensibles  et  elles  necessitent  de  ce  fait  une  post¬ 
edition  (verification  et  mise  en  forme  minimale  correcte  des  phrases)  de  I'ordre  de  15  minutes  par  page  pour  atteindre  un 
niveau  suffisamment  comprehensible.  Ce  temps  devrait  etre  reduite  de  moitie  une  fois  resolu  le  probleme  de  la  terminologie 
(codification  des  termes)  et  des  principaux  defauts  constates  dans  le  systeme,  comme  indiques  ci-dessous. 

A  ce  jour,  la  traduction  issue  de  la  post-edition  correspond  au  niveau  souhaite  de  traduction  brute  fournie  par  la  machine. 
Cette  post-edition  est  assuree  par  des  traducteurs  qui  participent  volontairement  a  l'experimentation  de  la  TAO  (Traduction 
Assistee  par  Ordinateur)  a  I'AEROSPATI  ALE. 

Le  role  actuel  du  traducteur  (correcteur)  consiste: 

( 1 )  a  verifier  la  traduction  brute  machine  et  a  la  rendre  suffisamment  comprehensible. 

(2)  a  analyser  les  textes  bruts  traduits  (signaler  le  manque  de  terminologie  et  les  erreurs  de  traduction  dues  a  la  maehine). 

A  terme,  une  fois  le  systeme  mis  au  point,  on  sera  amene  it  considerer  deux  types  de  traducteurs: 

( 1 )  un  correcteur  post-editeur  (verification  et  correction  des  traductions  brutes  machines),  qui  aura  pour  mission  de 
verifier  la  traduction  de  textes  devant  etre  diffuses  tres  rapidement  et  pour  lesquels  une  traduction  de  qualite 
humaine  n'est  pas  necessaire  (suivi  de  la  veille  technologique.  la  concurrence  etc....). 

(2)  un  traducteur  qualifie  reserve  a  des  traductions  presentant  des  difficultes.  que  seul  un  traducteur  humain  pent 
resoudre. 

Le  tableau  ci-apres  indique  les  temps  passes  pour  la  saisie  des  texte...  lorsqu'ils  ne  sont  pas  disponibles  sur  support 
informatique  (reconnaissance  des  caracteres  plus  corrections),  le  temps  de  post-edition  et  le  temps  global  de  la  traduction  (hors 
transfer!  et  temps  machine  qui  sont  difficilement  chiffrabies  et  peu  significatifs). 

Pour  information  on  indique  egalement  le  temps  que  passerait  un  traducteur  humain  (traduction  de  qualite  humaine 
brute)  afin  de  pouvoir  estimer  a  quel  temps  se  situe  une  traduction  brute  machine  suffisamment  comprehensible. 

Dans  I  hypothese  la  plus  favorable  a  I'avenir  (sans  post-edition,  qualite  brute  comprehensible): 

—  temps  de  traduction  par  la  machine  serait  d  environ  1 5— 20°-.,  par  rapport  a  une  traduction  qualite  humaine  brute. 
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GENERAL  COMMENTS 


The  purpose  of  the  experiment  being  carried  out  at  AEROSPATIALE  is  to  achieve  the  fastest  and  cheapest  possible 
translation  of  documents  of  immediate  interest,  containing  information  of  a  type  which  is  necessary  tor  good  company 
management. 

The  experiment  is  being  carried  out  using  the  SYSTRAN  system.  The  attached  documents  show  the  present  level  of 
development  of  this  system  and  highlight  those  points  which  require  further  attention  (expansion  of  the  current  SYSTRAN 
system  dictionaries)  and  improvement  (refinement  of  the  system  by  means  of  analyses  made  bv  the  translators). 

The  raw  translations  currently  being  produced  are  not  sufficiently  understandable  and  as  a  result,  require  post  editing 
(checking  and  minimal  sentence  restructuring),  in  order  to  reach  an  acceptably  intelligible  level,  which  requires  something  of 
the  order  of  1 5  min  per  page.  This  time  should  be  halved  once  the  problem  of  terminology  (coding  of  terms )  has  been  overcome 
and  'he  main  faults  found  with  the  system,  as  indicated  in  the  following  pages,  have  been  rectified. 

At  the  present  time,  the  translations  produced  after  post-editing  represent  the  quality  level  aimed  at  for  future  raw 
machine  translation.  This  post-editing  is  carried  out  by  translators  who  have  chosen  to  take  part  in  the  CAT  (Computer 
Assisted  Translation)  experiment  at  AEROSPATIALE. 

The  translator's  (corrector's)  present  role  is  to: 

(2)  Check  the  raw  machine  translation  and  make  it  sufficiently  understandable 

(2)  Analyse  the  raw  translations  (pointing  out  any  gaps  in  terminology  and  the  errors  in  translation  made  by  the 
machine) 

Eventually,  once  the  system  has  been  perfected,  we  shall  be  able  to  talk  about  two  types  of  translator: 

( 1 )  A  corrector  post  editor  (checking  and  correction  of  raw  machine  translations),  who  will  check  documents  w  hich 
need  to  be  issued  rapidly  and  for  which  human  translator  quality  is  not  required  (monitoring  of  state  of  the  art  in  the 
relevant  field,  activity  of  competitors  etc.) 

(2)  A  qualified  translator  reserved  for  translations  containing  difficulties  which  only  a  human  translator  can  resolve. 

The  table  below  shows  the  time  spent  on  document  input,  these  texts  not  being  available  on  data  processing  media  (ic  time 
for  character  recognition  plus  corrections),  the  post-editing  time,  and  the  global  time  taken  for  the  translation  (exclusive  of 
transfer  times  and  machine  times  which  are  difficult  to  cost  and  are  not  very  significant. 

For  comparison  we  also  show  the  time  which  would  be  spent  by  a  human  translator  to  produce  the  same  quality  of 
translation. 

Assuming  the  most  favourable  conditions  in  the  future  (no  post  editing,  comprehensible  raw  machine  translation),  the 
translation  time  by  computer  would  be  about  15—20%  of  the  time  for  human  raw  quality  translation. 
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EXTRACT  FROM  UNEDITED  MACHINE  TRANSLATION  OF  PAPER  2. 

Since  the  appearance  of  the  language,  the  inhabitants  of  the  sphere  communicated  always  between  them  by  local  'anguages.  or 
even  of  the  dialects,  which  were  really  comprehensible  only  by  a  relatively  low  number  of  persons  setting  up  a  cul  •  ural  regional 
group. 

The  twentieth  century  saw  the  conjugation  of  several  phenomena:  explosion  of  demography,  lengthening  of  the  life  expectancy, 
increase  in  the  trade  and  development  of  technology,  which  led  the  five  thousand  million  men  living  on  ground  in  1 987,  to  be 
communicated  between  them  on  several  levels. 

At  the  local  level,  language  —  or  dialect  —  is  always  the  communication  tool  of  the  cultural  group  of  an  area,  with  meanwhile  of 
the  new  characteristics:  the  number  of  inhabitants  of  certain  areas  increased  much  during  the  twentieth  century  (Algeria,  Egypt, 
Indonesia,  Pakistan...)  and  the  frequent  bijectivity  between  cultural  language  and  a  state  led  to  a  strengthening  of  membership  of 
this  cultural  group. 

At  the  overall  level,  trade  in  any  kind  which  develops  since  a  few  tens  years  on  the  whole  of  the  world  was,  in  point  of  fact, 
precedeed  during  two  or  three  important  centuries  of  trade  but  limited  to  what  one  could  call  them  “large  areas”  sphere.  And  as 
it  well  necessary  was  to  comprehend  oneself  to  trade  or  beat  oneself,  one  notes  that  today  eleven  languages  are,  in  fact,  become 
"vehicular"  for  one  of  these  areas: 

Mandarin  China 

Indi  India 

Englishmen  Northern  America  and  Western  Europe 

Spaniard  America  south  and  centre 

Arab  Not  them  Africa  and  the  Middle  East 

Malais  Asia  of  the  south-east 

Russian  Eastern  Europe 

Frenchmen  Western  Europe  and  Africa 

Swahili  East  Africa 

Japanese  Japan 

Portuguese  South  America 

1.2  Tendency 

The  whole  of  the  major  demographic,  socio-economic  and  military  balances  was  always  determining  for  the  importance  of 
language,  and  so  is  it  difficult  to  forecast  of  the  future.  Meanwhile,  if.  for  the  next  century,  the  assumption  of  major  cataclysm  is 
excluded  (biological,  military,  geological  or  cosmic),  the  demographic  factor  becomes  leading  and  done  to  appear  piofound 
changes  in  the  importance  of  the  major  languages  of  the  sphere. 

Then  the  most  outstanding  facts  for  the  fifty  years  to  come  are: 

—  the  stabilization  of  certain  languages:  Mandarin,  indi.  French.  Russian.  Japanese, 

—  the  growth  of  swahili,  malais  and  Portuguese, 

—  the  rapid  growth  of  Spanish  and  the  even  more  rapid  growth  of  Arabic  who  could  exceed  Mandarin  in  the  middle  of  the 
next  century. 

To  some  extent  share  of  currently  very  important  languages  in  number,  like  Mandarin,  the  indi  and  English  will  decrease  to  the 
profit  primarily  of  Spanish  and  especially  of  Arabic. 

In  addition,  political  strategic  choices,  like  the  decision,  in  1928.  of  Mustapha  Kemal  ATATURK  to  impose  the  Roman 
alphabet  in  Turkey  and  the  decision  to  rock  on  the  Cyrillic  alphabet  for  esquimaux  of  U.  r.  S.  S,  are  at  the  same  time  particularly 
important  in  the  plan  of  exemplarity  and  of  low  importance  at  the  quantitative  level  of  the  evolution  of  languages  for  the  w  hole 
of  the  sphere. 

2  THE  USE  OF  LANGUAGES  IN  1 98  7  IN  THE  FIELD  OF  SCIENCES  AND  TECHNIQUES. 

2. 1  Need  for  the  international  scientific  paper: 

For  a  country,  the  broad  cultural  aims  of  scientific  and  technical  information  were  synthetized  in  a  circular  (6)  of  the  Minister 
for  French  research  in  September  1981: 

—  to  express  the  scientifico-culturelle  identity  of  people. 

—  to  ensure  the  access  of  this  people  to  modernity,  in  its  own  language, 

—  to  ensure  democratically  the  equality  of  the  access  to  knowledge  by  the  passage  of  the  scientific  and  technical  information 
on  the  whole  of  the  social  fabric  of  a  nation. 

i  ms  third  point  not  being  only  the  recovery  of  a  concept  of  philosophical  ALAIN  which  developed  ‘he  idea  that  “culture  level 
of  a  country  was  not  defined  by  that  of  Plato  but  well  by  that  of  its  being  useful". 
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For  a  research  organism,  this  international  communication  is  also  necessary  for  the  defence  of  the  quality  of  the  organism  (its 
image)  or  more  simply  for  the  justification  of  the  use  of  the  allocated  funds.  We  know  all  that  the  major  international  periodic 
congresses  are  in  fact  the  occasion  given  to  the  research  organisms  to  give  an  account  of  their  work  in  front  of  public  opinion. 
At  the  level  of  the  individual  finally,  the  publication  remains  always  necessary,  despite  the  reality  of  the  "invisible  college”  made 
up  by  the  persons  who  are  concerned  with  a  same  research  orientation,  for  the  anteriority  problem  arises  often,  as,  quite  simply, 
the  consolidation  of  the  personal  the  research  worker's  prestige;  already  several  tens  years  ago  only  the  dilemma  publish  gold 
perish  is  well  anchored  in  the  head  of  all  the  scientists. 

In  any  event,  it  is  well  in  science  and  in  technology  that  intellectual,  technological  and  financial  investments  are  the  largest  ones 
and  therefore,  overall,  it  is  well  there  that  the  need  for  best  information  is  the  most  pressing  one.  In  addition,  a  report  of  the  D. 
O.  E  .  (7)  on  the  impact  of  scientific  and  technical  information  show  that  investment  in  I  S.  T.  is  particularly  profitable  for 
research. 

2.2  Type  of  the  communication:  oral,  written: 

All  the  analyses  (1,  2,  3,  4,  5.  6)  reveal  that  English  is,  in  fact,  the  most  practical  common  language  for  the  majority  of  the 
scientists. 

It  is  currently  the  most  widespread  language  quantitatively  among  the  scientists  and  technicians  of  the  whole  world,  and  it  is, 
additionally,  located  in  the  group  of  languages  which,  in  the  field  of  the  quality  of  the  publications  are  the  most  interesting  ones. 

English  being  therefore  currently  the  common  language  of  the  scientific  paper,  this  brings  a  fundamental  difference  between 
the  English  language  scientist  and  the  others.  With  regard  to  the  oral  communication  (?,  4),  and  particularly  during  the 
meetings,  congt esses  or  conferences,  anglophones  have  an  enormous  natural  advantage  which  makes  some  non  anglophone 
large  losers  of  the  communication  on  all  the  levels.  At  the  level  of  of  course  the  message,  but  more  especially  with  that  of  the 
emotional  relation:  the  loss,  would  say  MAC  LUHAN,  is  double,  at  the  same  time  on  the  "message"  and  on  the  “massage". 

This  phenomenon  is  still  amplified  by  the  fact  that  many  English  or  American  scientists  comprise  only  English  and  that  they 
impose  to  some  extent,  to  the  whole  of  a  group,  the  choice  of  the  English  language  where  they  are  of  course  the  more  at  ease.  We 
have  all  in  memory  situations  where  a  group  discussing  in  German.  French  or  Spanish  rocks  on  English  on  arrival  of  only  one 
person,  to  the  major  disadvantage  of  all  the  autres*. 

With  regard  to  the  written  communication  (3,  4).  the  situation  are  reversed  and  turn  to  the  advantage  of  non  anglophone. 

Indeed  (3),  12  %  of  anglophones  can  read  French  and  4  %  German,  whereas  97  %  of  the  French-speaking  persons  and  of 
German-speaking  can  read  English.  This  enables  therefore  the  nen  anglophone  research  workers  to  have  access  to  the 
anglophone  literature  which  represents  30  %  production  of  the  publications. 

This  result  is  corroborated  by  the  analysis  of  "science  index  quotation"  which  shows  that  the  anglophone  authors  quote  with  90 
%  of  the  anglophone  documents  (whereas  the  production  is  of  50  %)  and  the  possible  difference  in  quality  cannot  explain  a  so 
major  difference,  more  especially  as  two  phenomena  come  into  play  in  opposite  direction:  the  fact  that  much  the  major 
international  reviews  are  in  English  language  is  compensated  by  the  fact  that  the  documents  bought  by  the  libraries  in  language 
other  than  English  much  better  is  selected  (owing  to  the  difficulty  of  access)  therefore,  in  doubt,  better  quality,  and  therefore 
more  important  to  read. 

A  rather  remarkable  point  of  the  written  communication  which  does  not  appear  me  to  have  been  pointed  out  enough  is  that 
certain  specialists  in  the  reading  committee  of  a  major  international  review  are  informed  of  the  contents  of  Article  six  month 
before  the  other  readers.  All  occurs  as  if  information  went  to  the  best  informed. 

2.3  Type  of  contents: 

Inside  sciences  and  techniques,  the  linguistic  situation  are  not  the  same  one  of  a  field  to  the  other. 

If  we  take  the  example  of  the  French  scientists  (3),  we  note: 

—  that  in  the  fields  of  mathematics,  of  physics  and  of  chemistry,  more  than  half  of  French  author  articles  are  written  in 
English  and  that  this  percentage  made  only  grow  for  the  last  years; 

—  that  in  the  fields  of  the  “sciences  applied"  (biology,  sciences  of  the  ground,  medicine  and  technology),  article  to  three  is 
written  in  English,  and  that  the  growth  of  the  number  of  the  publications  in  English  language  seems  to  have  stopped. 

In  addition,  we  note  that  the  share  of  Japanese  language  grows  quickly,  particularly  in  technology.  A  report  of  1986  (8) 
considers  at  600  000  the  number  of  scientifico-technologiques  publications  a  year,  of  which  less  than  3  000  arc  the  subject  of  a 
translation,  even  partial. 

These  two  examples  permit  the  following  official  report:  the  publications  in  fundamental  sciences  are  directed  towards  writing 
in  English ,  those  of  applied  sciences  are  primarily  written  in  national  language. 

*  I  remember  a  negotiation  meeting  in  Brussels  assembling  9  persons  of  6  countries:  8  originating  in  5  countries  speaking  usually  French 
and  American  speaking  only  English. 
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Considerant  le  developpement  technique  dans  1'industrie  de  ('information,  on  doit  realiser  que  I'ordinateur  sera  integre  dans 
presque  chaque  activite  humaine  professionnelle.  Dans  le  domaine  du  mot  ou  du  traitement  de  textes.  ('interaction  homme- 
machine,  sur  le  niveau  de  PC  ou  sur  la  base  d  un  postc  de  travail  sur  le  terminal  au  lieu  d'  une  machine  a  ecrire.  sera  la  norme 
dans  un  proehe  avenir,  qui  sigmfie  de  la  fin  des  annees  '80. 

De  I'autre  cote,  le  problems’  de  la  traduction  automatique  de  texte  naturel  en  general  —  meme  pour  ne  pas  parler  sur  la 
comprehension  de  la  parole  et  la  traduction  de  la  parole  —  ne  sera  pas  resolu  dans  le  sens  de  HQFT  (la  traduction  automatique 
entierement  automatique  de  haute  qualite).  en  raison  de  la  complexity  des  langages  naturels.  Done  il  y  a  des  limites  (plus  ou 
moins  dans  l'approche  scientifique)  et  des  possibilites  des  outils  techniques  d’integration  en  cours  de  (I'humain)  traduction  ou 
meme  texte  comprenant  par  les  appareils. 

Le  but  de  cette  contribution  est  d'explorer  les  possibilites  de  I"  utilisation  de  I'ordinateur  dans  le  domaine  de  lappareil  et  de  la 
traduction  automatisce.  Meme  si  il  pourrait  etre  d'une  certaine  attraction  traiter  egalement  de  principaux  problemes  de  la 
translatabilite  les  textes  (et  de  la  comprehension),  nous  nous  concentrerons  sur  la  traduction  technique,  qui  signifie  la 
traduction  de  textes  techniques  ou  communs. 

Aspects  systematiques 

Meme  si  il  pourrait  etre  d'un  certain  interet  comment  la  traduction  automatique  est  realistic,  le  jeu  linguistique  d'aspects 
(particulierement  les  modeles  de  la  grammaire  et  des  strategies  possibles  ou  employees)  une  partie  subordonnee.  Tellement  on 
peut  supposer  qu'il  v  a  line  sorte  de  "bcitc  noire",  oil  un  texte  ou  des  mots  dans  un  langage  naturel  est  mis  dedans  et,  avec  ou 
sans  1'  interaction  humaine  un  ou  les  traductions  alternatives  sortent  du  systeme.  La  traduction  pourrait  etre  bonne,  utile  ou 
mauvaise  sous  les  aspects  de  I'utilisateur. 

Se  concentrant  sur  ('utilisation  et  1'utilite  de  la  traduction  automatique  ou  de  la  traduction  assistee  par  ordinatcur.  on  doit  —  cn 
principe  —  distinguer  entre  deux  groupes  d'utilisateurs  principaux: 

( 1 )  le  soi-disant  utilisateur  final,  g  de  E„  un  essai  habile  d'obtenir  les  informations  d'un  article  ecrit  dans  un  langage  naturel 
plus  ou  moins  inconnu  a  lui;  une  personne  ecrivant  une  lettre  a  un  ami  en  langue  etrangere...  et 

(2)  un  agent  professionnel,  particulierement  un  traducteur  humain  qui  essave  d'utiliser  I'appareil  comme  outil  en  cours 
d'accomplir  son  travail. 

Sous  ces  aspects,  on  peut  laisser  de  cote  des  projets  et  de  base  au  sujet  de  la  recherche  scion  les  principcs  traduction 
automatique  comprehension  de  la  langue  et  orienter  sur  les  outils  pratiques  resp.  svstemes. 

Comme  point  de  depart,  il  est  important  de  distinguer  entre  deux  strategies  principales:  traduction  automatique  (MT)  et 
traduction  assistee  par  ordinatcur  (CHAT). 

—  Un  systeme  s'appellera  un  systeme  de  traduction  automatique  (MT)  seulement  si  un  processus  de  traduction  —  le 
demurrage  du  texte  source  iisibie  sur  ordinatcur  est  accompli  sans  aucune  interaction  humaine  pour  atteindre  une  qualite  de 
texte  cible  au  moins  “assez  bonne"  pour  ('information 

—  Un  systeme  s'appellera  un  systeme  de  traduction  assistee  par  ordinatcur  (le  CHAT)  si  une  interaction  humaine  est 
necessaire  ou  prevue  pour  atteindre  le  but  d'une  “bonne"  traduction  d'un  texte  source  (Iisibie  sur  ordinatcur  ou  pas). 

II  est  bien  evident  qu'un  systeme  MT  peut  etre  employe,  en  plus,  comme  composant  dun  CHAT :  un  texte  peut  etre  ajuste  avail! 
que  les  commencements  MT  pour  obtenir  de  meillcures  traductions  automatiques  ("pre-editant ").  et  ou  un  texte  traduit  par 
ordinatcur  peut  etre  post-edite  par  les  traducteurs  humains  pour  obtenir  la  qualite  plus  elevee 

II  V  a  beaucoup  de  svstemes  sur  MT  et  du  CHAT  sur  !c  marchc  affirmant  d’etre  I'outil  pratique  droit,  et  le  choix  est  difficile 
etre  fait  sans  avoir  des  entires  precis  pour  ia  decision. 

Puisque  la  quantitc  de  donnees  dans  les  dictionnaires  automatises  —  sur  la  longue  ligne  de  developpement  des  svstemes  MT  et 
de  CHAT  —  est  le  composant  dc;cisif.  la  mise  a  jour  du  dictionnaire  (elcctroniquc)  joue  un  role  substantiel  dans  les  deux 
solutions  de  rechange: 

—  Un  concept  peut  etre  —  particulierement  si  le  systeme  a  toujours  une  grande  capacite  d'entrees  de  dictionnaire  que 
I'utilisateur.  g  de  F...  le  traducteur  ne  devrait  pas  participer  a  un  tel  processus  (voir,  par  exemple  le  Systran-concept).  de  sorte 
qu'il  y  ait  des  specialistes  requis  du  cote  d'administration  de  systeme  p,.^r  faire  le  travail  d'  ameliorer  la  base  de  donnees  de 
dictionnaire: 

—  L'autre  maniere  est  la  laisser  a  I'utilisateur  professionnel  achever  le  dictionnaire  du  systeme  ou  ajouter  un  vocabulaire 
special  (voir,  comme  exemple.  le  logo-concept). 

II  y  a  d’autres  aspects  de  systeme  qui  jouent  un  role  important  sur  le  processus  de  decision:  la  disponibilite  des  couples  de 
langues.  la  possibility  de  cn  utilisant  ou  manipuler  des  types  de  texte  speciaux  (g  de  E„  minutes,  lettres,...). 


T-7 


Criteres  ((’evaluation 
Quality 

Meme  si  une  normalement  boite  (ou  doit)  manipuler  le  systeme  comme  “boite  noire",  la  est  des  differences  dans  la  qualite  des 
resultats  (crus)  “purs"  de  traduction  d’appareil  (la  production).  II  n'est  pas  tres  facile  de  donner  une  mesure  precise,  mais  il  y  a 
certains  criteres  importants  a  noter  (voir,  pour  les  details,  en  particulier.  le  concept  Van  SLYPE  ( 1 982)  et  les  descriptions  des 
deux  evaluations  SYSTRAN,  Van  SLYPE  ( 1 979)). 

Les  criteres  principaux  sous  ces  aspects  sont: 

(1)  fiabilite  et  fidelite,  ces  moyennes:  a  quel  degre  (mauvais,  bon  assez,  bon)  la  teneur/signification  du  texte  original  est 
conservee. . 

(2)  comprehensibilite  et  I’intellegibilite,  ces  moyennes:  a  quel  degre  1'utilisateur  final  peut  lire  et  comprendre  le  texte  traduit. 

Ceci  regarde  assez  simple,  mais  les  problemes  se  produisent  dans  le  detail:  Ainsi,  sur  la  syntaxe  et  le  niveau  stylistique,  un 
systeme  normalement  produira  les  traductions  qui  sont  pires  que  les  traductions  humaines,  tandis  que  sur  le  niveau  lexical, 
particulierement  en  identification  du  bon  term  (technique  s),  la  traduction  d'un  systeme  peut  meme  etre  plus  precise  et 
consistant  qu'une  traduction  humaine. 

Environnement  duplication 

MT  ou  un  CHAT  doit  etre  vu  dans  1'environnement  (technique)  des  possibilites  d’application.  Ainsi,  normalement  une 
decision  est  non  seulement  la  qualite  orientee,  mais  egalement  basee  sur  les  possibilites  de  l’integration  dans  un  texte  ou  un 
systeme  complexe  de  traitement  de  textes.  Sous  ces  aspects,  les  composants  suivants  doivent  etre  consideres: 

Integration  dans  les  bases  de  donnees  (bibliographiques  ou  textuelles) 

Entretemps,  les  bases  de  donnees  sont  —  techniquemcnt  parie  —  accessibles  world-wide  par  I’intermediaire  des  reseaux  et 
meme  satelite  de  la  communication  de  commutation  par  paquets.  Ainsi,  surmontant  les  barrieres  linguistiques.  g  de  E..  entre 
I’anglais  et  le  Japonais,  mais  particulierement  sur  le  marche  multilingue  europeen,  devient  un  desir  important.  Des  efforts 
experimentaux  pour  integrer  des  systemes  MT  dans  un  tel  processus  de  [’information  sont  faits  au  Japon:  comme  un  exemple  la 
base  de  donnees  inspec  qui  est  originellement  anglaise  peut  etre  interrogee  a vec  les  mots  cles  japonais.  Les  mots  cles  et  plus  tard 
(anglais)  le  titre  sont  automatiquement  traduits  pendant  le  dialogue  en  Japonais  (Nagao  et  autres.  1982).  D’une  maniere 
semblable,  un  systeme  MT  pour  1’allemand  a  I'anglais  est  employe  par  1'intermediaire  d’un  processus  en  lots  et  avec  la  post¬ 
edition)  pour  traduire  les  titres  des  bases  de  donnees  allemandes  (Zimmermann  et  autres  1 987). 

II  est  bien  evident  que  la  traduction  de  titres  et  d’abreges  pourrait  etre  egalement  faite  par  les  traducteurs  humains.  Mais  il  y  a 
certains  arguments  pour  MT  et  CHAT:  le  texte  pour  etre  traduit  est  lisible  sur  ordinateur.  tellement  la  est  une  base  ideale  pour 
utiliser  un  ordinateur,  les  domaines  ou  les  secteurs  sur  lesquels  en  rubrique/ texte  est  oriente  sont  normalement 
“physiquement"  marques,  de  sorte  que  la  classification  ou  meme  les  fonctions  de  thesaurus  puisse  etre  employee 
particulierement  pour  le  transfert  lexical  (la  desambiguisation).  les  besoins  techniques  de  vocabulaire  d’etre  tres  precis,  de 
sorte  que  I’  ordinateur  aide  en  etant  compatible. 

Indexation  automatique 

L’indexation  (le  complet)  du  texte  pourrait  etre  un  bon  elfet  secondaire  d'employer  MT  et  CHAT.  Pour  le  transfert  lexical,  on 
doit  deriver  des  formes  de  mot  aux  formes  de  base;  des  composes  et  des  mots  complexes  doivent  etre  identifies  en  tant  que  tel, 
exprimez  les  informations  de  classe,  meme  des  relations  entre  les  termes  sont  employees  pour  la  desambiguisation.  Ainsi  il  peut 
etre  considere  de  foumir  de  tels  resultats  de  production  ou  d’intermediaire  des  systemes  MT  aux  fins  de  l’archivage  de 
document  et  la  recherche  documentaire. 

Texte  et  traitement  de  textes 

II  n’y  a  aucun  doute  que  le  traitement  de  textes  joue  un  role  important  dans  chaque  environnement  de  traduction.  Meme  les 
traducteurs  (humains)  freelance  de  plus  en  plus  emploieront  un  systeme  de  traitement  de  textes  (sur  le  PC),  et  la  sont  une  petite 
etape  pour  integrer ,  de  quelque  fajon  que  ce  soit,  des  glossaires  ou  des  listes  (prives)  de  mot  accessible?  par  I’intermedia'rr  dc? 
soi-disant  “fenetres"  sur  l’ecran  —  au  lieu  d’uti'iscr  des  "’cites  d’earte  indiuc.  u  usi  bie.i  eviuent  que  d  autres  fonctions,  g  de  E.. 
les  controleurs  d’orthographe,  de  grammaire  et  de  style,  seront  de  plus  en  plus  integres  dans  un  tel  processus. 

En  consequence,  le  “texte  source”  est  lisible  sur  ordinateur,  mais  on  doit  adapter  MT  et  le  systeme  de  CHAT  (aux  differents) 
systemes  de  traitement  de  textes  (par  exemple  le  Wang  OIS  est  combine  avec  les  LOGOS  et  SYSTRAN,  WordPerfect  est 
combine  avec  SYSTRAN).  Si  de  tels  outils  sont  disponibles,  la  post-edition  des  resultats  MT  peut  etre  soutenue  par  le 
redacteur  special. 

Un  probleme  dans  cet  environnement  est  la  combinaison  de  ces  outils  avec  les  equipements  locaux  de  MT  (voir,  par  exemple, 
des  LOGOS)  ou  de  la  connexion  avec  un  centre  de  service  de  traduction  (voir,  par  exemple  ,  le  concept  de  1’appiication 
SYSTRAN  dans  la  Commission  europeenne  ou  meme  ('utilisation  du  minitel-systeme  en  France  obtenir  des  traductions 
automatiques  (par  SYSTRAN). 
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EXTRACT  FROM  UNEDITED  MACHINE  TRANSLATION  OF  PAPER  8. 

In  the  scientific  and  technical  literature,  especially  in  the  fields  of  high  technology,  the  terminology  is  moving  Research,  the 
industrial  sector  and  the  commercial  sector  are  confionted  with  a  discovered  news,  new  products,  therefore  with  new  names. 
The  lack  of  terminological  standardization  and  the  use  of  industrial  jargons  make  comprehension  even  more  difficult  not  only 
when  it  is  foreign  languages,  but  also  for  the  users  practising  the  language  of  the  document. 

The  information  science,  created  to  the  specialists'profit.  tends  to  cover  exhaustively  the  scientific  and  technical  literature 
available  in  the  world.  The  production  of  computerized  bases  and  data  banks  became  practical  current  and  in  fact  the  only 
solution  of  information  vis-a-vis  the  increasing  document  mass. 

Two  are  distinguished  kind  of  bases  and  data  banks:  those  which  are  of  universal  utility  (CHEMABS.  1NSPEC.  METADEX. 
NTIS.  etc)  and  those  which  are  worked  out  for  use  plus  restraint,  for  specific  industry,  some  undertakings  or  even  only  one 
institution.  But  whatever  they  are.  their  user  producers  and  run  up  against  the  language  and  terminology  problems. 

Standardization  terminological  itself,  as  welt  as  the  production  of  the  data  bases,  closely  dependent  on  the  communication 
between  the  document  producers  .  to  the  availability  of  these  documents,  to  the  description  of  those  and  to  the  organization  of 
coherent  information. 

The  availability  of  the  documents  is  problematic  for  well-known  reasons: 

—  industrial  concurence,  safety,  reason  of  state; 

—  difficulty  of  producers'  tracking  and  holders  of  documents  and  of  contact  w  ith  those. 

The  compilation  of  a  data  base  constitutes  a  first  step  towards  a  better  communication  and  towards  terminological 
standardization;  indeed,  the  producer  of  the  data  base  contacts  the  authors  of  the  documents,  gathers  these  documents  or  their 
description,  indexes  them  in  a  uniform  way.  creating  thus  a  more  or  less  homogeneous  terminology.  In  return,  the  users  of  the 
data  bases  familiarize  themselves  with  only  one  terminology  and  tend  to  use  it. 

The  universal  use  of  the  data  bases  by  users  of  various  countries  and  of  languages  various  accustoms  them  all  to  only  one 
language  and  only  one  terminology  (in  majority  English)  and  already  opens  the  breche  in  the  wall  of  linguistic 
incomprehension. 

It  is  advisable  here  to  stress  that  the  linguistic-  problems  are  different  from  a  country  to  another:  there  are  unilingtial  and 
multilingual  countries . 

1  anguages  also  can  be  divided  into  two  categories:  easy  said  languages:  Germanic  Latin  languages  and.  and  languages  that  one 
describes  as  difficult  to  reach:  languages  Slav  ic.  Chinese.  Japanese.  Arab,  etc);  of  these  last  only  will  be  retained  those  which 
convey  currently  a  significant  scientific  production,  such  as  for  example  Japanese  and  Russian.  When  one  differentiates  the 
problems  by  country  one  points  out  that  the  unilingual  countries  such  as  France.  Great  Britain,  the  USA  have  problems  with  all 
languages:  but  if  they  have  difficulties  of  comprehension,  on  the  other  hand  they  for  a  long  time  set  up  structures  and  created 
organisms  facilitating  the  translation  for  languages  of  both  groups.  The  figures  are  significant:  2d%  of  all  the  translations  carried 
out  concern  the  pairs  of  French-English  and  German-English  languages. 

The  multilingual  countries  such  as  Switzerland.  Canada.  Belgium  have  much  less  problems  of  comprehension  of  the  original 
texts  and  have  consequently  a  network  of  scientific  translation  weaker,  centred  especially  on  difficult  to  reach  languages. 

To  reduce  the  linguistic  barrier  do  not  mean  to  translate  systematically  the  texts  in  the  language  of  the  user  of  the  moment. 
When  one  speaks  about  the  translation  and  structure  network  facilitating  the  translation,  not  very  many  are  those  which  carry 
out  of  it  the  width  and  diversity.  It  is  however  of  primary  importance  that  the  information  users  and  the  specialists  in 
information  know  them. 

There  are  initially  the  integral  translations  of  documents  (cover-to-cover  translations)  that  one  publishes  especially  for  the 
difficult  to  reach  language  documents  (Russian.  Japanese).  There  is  also  reference  vorks  which  point  out  them. 

There  is  also  a  considerable  number  of  texts  which  are  already  translated  and  can  be  re-used  at  a  later  date  several  times.  For 
that  one  should  detect  them,  to  collect  them  and  to  compile  them  in  the  form  of  reference  or  data  bank  works  to  point  out  them 
to  the  potential  users.  The  translated  document  collection  comes  itself  unaeainst  the  already  evoked  difficulties  of  contpotiuui, 
of  safety. 

It  is  in  these  particular  cases  that  have  to  intervene  of  the  more  restricted  groupings  guaranteeing  control  of  the  users,  defence  of 
the  interests  of  each  one  inside  the  grouping  as  well  as  professional  secrecy  for  the  translation  suppliers  and  their  users.  It  is  only 
in  this  context,  by  respecting  the  evoked  factors  that  it  is  possible  to  gather  the  translations  already  carried  out. 

It  goes  without  saying  that  the  data  base  is  the  most  valid  solution  to  put  these  translations  at  the  disposal  of  the  interested 
parties.  For  example  it  is  advisable  to  quote  "World  Translation  index",  data  base  which  proposes  the  translations  gathered  in 
Europe  and  in  the  USA  by  the  translation  national  centres  (USA)  and  the  international  translation  centres  (Europe).  WT1  is 
also  diffused  in  reference  traditional  work  form. 


Certain  groupings  point  out  for  their  members  the  translations  carried  out  within  the  group,  for  example  the  metallurgical 
research  centre,  Belgium. 

As  a  last  resort  to  abolish  the  linguistic  barrier  there  is  obviously  the  translation.  One  distinguishes: 

—  traditional  translation. 

—  translation  carried  out  by  computer; 

—  computer-assisted  translation. 

The  method  of  traditional  translation  called  upon  the  translators  and  presents  certain  difficulties,  especially  for  the  very 
specialized  technical  texts:  the  translator  encounters  problems  of  technical  comprehension  and  terminology. 

The  translation  by  computer  made  major  progress,  but  its  effectiveness  and  the  quality  of  the  translation  is  still  debatable. 
Multiple  studies  spoke  about  it,  evaluating  its  advantages  and  its  defects.  It  is  considered  however  that  these  translations  are 
valid  at  least  for  the  scientists'  information  and  provided  that  the  vocabulary  input  into  the  computer  is  limited  to  a  properly 
determined  field.  It  is  useful  to  stress  the  positive  result  obtained  for  the  indexdation  of  the  documents  and  the  preparation  of 
the  summaries  to  introduce  into  the  data  bases.  The  standard  or  repetitive  texts  are  also  rather  well  translated  .  The  principal 
advantage  of  the  translation  by  computer  is  indisputably  the  possibility  of  standardization  of  the  terminology.  The 
phenomenon  of  standardization  caused  by  the  data  bases  for  language  appears  here  for  several  languages.  Unfortunately  the 
essential  reproaches  that  is  generally  formulated  are  considerable:  price  too  high,  considerable  waste  of  time  in  revision  of  the 
texts,  inaptitude  of  the  computer  to  take  into  consideration  the  nuances  of  the  text. 

Computer-assisted  translation,  as  for  more  flexible  it,  combine  the  creative  the  translator's  work  and  the  possibilities  of  data 
processing.  It  is  for  that  that  it  presents  an  advantageous,  valid,  realizable  solution  without  excessive  expenses.  Its  advantages 
are  obvious  for  the  user  of  the  texts  and  for  the  translator,  the  output  and  the  quality  of  the  translation  are  improved. 

It  is  noted  that  the  possibilities  of  reduction  of  the  linguistic  barrier  are  numerous;  it  will  be  therefore  useful  to  know  the 
institutions  which  implement  them. 

There  are  the  translation  and  diffusion  organisms  above  all  of  the  bases  and  data  banks: 

—  base  producers  and  data  banks; 

—  information  centres; 

—  description  centres  of  the  translations; 

—  terminological  centres. 

Organisms  contact  the  document  producers,  compile  the  literature,  put  information  at  the  disposal  of  the  user,  gather  the 
existing  translations,  produce  and  exploit  computerized  terminological  dictionaries  They  arc  useful  to  the  user  since  they 
answer  in  various  ways  their  requirements  in  information  and  abolish  the  linguistic  barrier.  They  are  also  the  auxilliaires  of  the 
translator  since  they  are  implied  in  the  production  and  the  exploitation  of  the  terminological  sources  and  of  the  sources  of 
information  on  the  existing  translations. 

Information  user  confronted  with  the  linguistic  problems  and  the  translator  who  has  to  help  it  has  both  the  major  interest  to 
know  them  and  to  resort  to  their  services. 

Nevertheless,  when  it  is  a  question  of  abolishing  the  barrier  linguistic  he  is  well  the  translator  who  is  personnage-cle  it.  These 
problems  deserve  consequently  a  detailed  study. 

More  than  ever  the  translator  finds  himself  confronted  with  very  specialized  scientific  texts  and  with  descriptions  of  high 
technicality.  He  is  compelled  to  cope  with  the  problems: 

—  comprehension  of  the  texts; 

—  terminological  research  for  advanced  techniques; 

—  research  of  the  sources  of  information; 

—  an  adaptation  to  the  modem  work  and  information  retrieval  methods. 

The  comprehension  of  the  texts  is  made  difficult  by  the  complexity  of  the  subjects  which  for  55  %  raise  field  of  precise  sciences 
and  of  the  high  technology;  the  very  pushed  specialization  of  the  disciplines  implied  in  each  subject  insists  on  a  certain  scientific 
training  or  at  least  a  close  contact  with  the  specialist. 

The  research  of  the  equivalent  terms  in  the  source  language  and  in  the  target  language  is  made  extremement  difficult  because  of 
mouvance  ot  the  terminology  of  the  lack  of  standardization  in  the  definitions  of  new  products  and  methods,  industrial  jargons: 
the  translator  devotes  on  average  1 7  %  of  his  time  in  the  search  for  information  and  58  %  only  to  the  translation  as  such. 

The  dictionaries  —  traditional  the  translator’s  tools  —  present  a  considerable  shift  in  relation  to  the  proposed  texts,  especially  in 
the  fields  mentioned  and  in  advanced  disciplines. 

The  petrol  even  of  the  dictionary  and  the  edition  methods  does  not  allow  any  sufficiently  rapid  update;  it  is  generally 
considered  that  they  are  4  years  of  delay  on  the  technological  developments. 


BARRIERS  TO  INFORMATION  TRANSFER  AND  APPROACHES  TOWARD 
THEIR  REDUCTION 
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